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Antigens and Uses Thereof 

The present invention relates to the use of antigens derived 
from the RDl or BD2 regions of the Mycobacterium tuberculosis, 
Mycobacterium bovis or Mycobacterium africanum genomes, and 
peptides derived thelrefrom, as diagnostic reagents, in 
particular in the context of diagnostic kits. In addition, 
certain of these peptides, as well as other antigens and 
peptides derived from the RD14 region of the genome are suitable 
for use as vaccines. Novel fusion peptides are also part of the 
invention. 



in particular, the p^^esent invention relates to diagnostic kits 
conprising such antigens for differentiating between those 
15 mammals infected by tuberculosis, those which have been 

vaccinated against tuberculosis, and those mammals, which have 
been sensitised by environmental Mycobacteria. 

The present invention further relates to novel Mycobacterium 
20 tuberculosis, Mycobacterium bovis and Kycobacterium afxicanmn 
peptides derived ftom such antigens. 

The present invention also relates to vaccines against 
Kycobacterium infections, in particular, Mycobacterium 
25 tuberculosis, Mycobacterium bovis or Mycoba cteriuin africanum 
infections, as well as to veterinary and pharmaceutical 
compositions containing these and their preparations. 

Bovine tuberculosis (BTB) is caused by Mycobacterium bovis and 
30 shares greater than 99.9% DNA identity with M. tuberculosis, the 
main cause of human tuberculosis. Moreover, BTB is a zoonotic 
disease and was responsible during the 1930s and 1940s for 
approximately 6% of the total human deaths due to TB, and more 
than 50% of all cervical lymphadentitis cases in children. The 
35 introduction of pasteurisation of milk in the 1930s dramatically 
reduced the transmission from cattle to man. However, it still 
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remains a small but significant cause of human morbidity and 
mortality especially in developing countries and is seen. as one 
of the most important infectious diseases of both man and other 
animals in the world- 

5 

Mycoha.c'teriiim hovis causes disease in both cattle and man. In 
the absence of a BTB control programme, TB in cattle can have 
severe iit^lications for animal welfare, causing reduced 
productivity and premature death, resulting in substantial 

10 econoinic losses to affected farms. A compulsory eradication 

programme based upon the slaughter of infected animals, detected 
by the single intradermal comparative tuberculin skin test, 
began in Great Britain (GB) in 1950 and by 1960 it had been 
implemented in all of Britain. These measures resulted in the 

15 dramatic reduction of bovine tuberculosis in GB from incidence 

rates of around 40% of cattle infected with H. hovis to 0.41% in 
1996. However, despite continued implementation of these 
control measures , the incidence of BTB in cattle has been 
steadily rising since 1988, possibly due to a wildlife reseocvoir 

20 of M. ±>ovis. 

BCG is an attenuated strain of M. hovis, and is currently the 
only available vaccine for the prevention of BTB. Encouraging 
results with BCG have been reported in New Zealand where a 
25 significant level of protection in BCG vaccinated cattle against 
experimental hovis infection has been recently demonstrated. 

Immunity to K.' -tuherculosis is characterised by three basic 
features: 1) living bacilli which efficiently induce a 
30 protective immune response; 2) specifically sensitised T 

lymphocytes which mediate this protection, and 3) interferon 
gamma (IFN-y) which is an important mediator molecule. 

Cattle with a mycobacterial infection will predominantly mount a 
35 cellular immune response. Therefore, the skin test using 
tuberculin PPD has become an integral part of the bovine 
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tuberculosis eradication prograinme. In addition to skin tests, 
blood-based diagnostic assays that measure antigen-induced 
lymphokine production such as the IFN-y are also under 
consideration. The cytokine ITO-y appears to be critical in the 
5 development of immunity to M. tuberculosis. For example, both 
mice with a disruptive IFN-y gene and hiamans with mutated IFN-y 
receptor are highly susceptible to mycobacterial infections. 
However, specificity constraints are associated with the use of 
PPD in such assays. These arise due to the crude mixture of M. 
10 jbovis proteins that it contains, many that are cross-reactive 
with other environmental mycobacterial species, e.g., avium 
or M. introLcellnlaire and iirportantly the vaccine strain H. bovis 
Bacille Calmette-Guerin (BCG) . 

15 A cattle vaccine would reduce the risk of cattle infection and 
hence result in lower t\:ibercxilin test frequencies and 
significant cost savings. It is believed that the development 
of an improved cattle vaccine holds the best long-term prospect 
for BTB control in British herds. In addition, it would be 

20 desirable to develop a complimentary diagnostic test to 

differentiate between vaccinated animals and those infected with 
M. bovis [differential diagnosis) in parallel with the vaccine 
to ensure continuation of the test and slaughterrbased control 
strategies . 

25 

As previous studies have demonstrated, diagnostic reagents which 
distinguish between vaccinated and infected cattle can be 
developed using specific, defined antigens that are present in 
virulent bovis but absent from the vaccine strain. Genetic 

30 analysis of BCG has revealed that several large genomic regions 
have been deleted during attenuation and s\absequent prolonged 
propagation in culture [Behr, M. A., et al- . 1999. Science 
284:1520 - 1523; Gordon, S. V., et al. 1999. Mol. Microbiol. 
32: 643-655]. These regions have been characterised and 

35 antigens from one of these regions, RDl [Mahairas, G. G., et al. 
1996. J. Bacteriol. 178:1274-82], have been studied extensively 
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in several species including hxjmans and cattle- For example, it 
Inas been recently demonstrated that protein or peptide cocktails 
composed of two RDl region antigens, ESAT-6 and CFP-10/ can be 
used to distinguish between BCG vaccinated and M. bovis infected 
5 cattle [Van pinxteren, et al-- 2000. Clin. Diagn. Lab. 
Immunol. 7:155-160; Vordermeier, H. M. et al. 2001. Clin. 
Diagn. Lab. Immunol. 8:571-8]. 

However, the level of sensitivity achieved with these antigens 
10 has not reached that of tiobercuiin. It would, therefore, be 
desirable to provide other antigens in order to achieve this 
desired sensitivity- Such antigens may also be useful in 
supplementing the ESAT-6 and CFP-10 to achieve even greater 
sensitivity. 

15 

In alternative approach to using recombinant proteins is the 
application of overlapping synthetic peptides derived from those 
antigens described above. Synthetic peptides have the 
advantages of lower production costs, easier standardisation, 
20 improved c[uality control and carry no risk of infection since 
they are chemically synthesised. 

Such synthetic peptide epitopes have been found to have great 
potential in the study of immune responses in cattle and in the 

25 development of diagnostic reagents. For example, formulation of 
10 synthetic peptides derived from ESAT-6 and CFP-10 resulted in 
similar cellular immune responses to those seen with the whole 
recombinant antigens. When assayed in cattle this cocktail 
could distinguish between M. Jbovis infected animals ^nd BCG 

30 vaccinated cattle with sensitivity similar to PPD and with a 
greater specificity [Vordermeier, 2001 supra.]. 

Differential diagnosis is not the only concern associated with 
BCG. BCG vaccination studies have highlighted the variability 
35 with regard' to its efficacy. In humans, this ranges from 0 to 

80% when tested in different popiolations, with consistently poor 
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results observed in the equatorial regions. Similar variations 
in efficacy have also been reported in BCG vaccination 
experiments and trials in cattle (e.g. [Buddie, B., et al, 
2002. Vaccine 20: 1126-33). It would therefore be desirable to 
improve or supplement BCG vaccination. Strategies to generate 
novel tiiberculosis vaccines include sub-unit vaccination with 
either DNA vaccines or protein subunits (Rev. [Anderson, P. 
2001, TB Vaccines: progress and problems. Trends Immunol]). 
Antigens such as MPT-64 and ESAT-6, whose genes were deleted in 
BCG, have been tested as DNA vaccines and imparted protective 
immunity in small animal models. 



The present invention seeks to provide an improved diagnostic 
test to differentiate between vaccinated mammals and those 
15 infected with tuberculosis. Preferably, the test of the present 
invention differentiates between animals vaccinated against 
Mycobacterium bovlsr Mycohacterlvm tuberculosis or Mycobacterium 
afrlcanum and those infected with Mycobacterium bovlsr 
Mycobacterium tuberculosis or Mycobacterium afrlcanum. 

20 

The present invention also seeks to provide an improved vaccine 
for control of tuberculosis and in particular to control 
tioberculosis in cattle. The tuberculosis disease also affects a 
number of other different animal species such as guinea pigs, 
25 badgers, possums and deer. The vaccines of the present 

invention may therefore be useful in* the control of tuberculosis 
infections in such different animals. 

The applicants have found that certain polypeptides derived from 
30 the RDl or RD2 regions of the KycoJbacterium tuberculosis^ 

Mycobacterium bevels or Mycobacterium afrlcanum genomes, or a 
variant, homologue or fragment of these, other than ESAT-6, CFP- 
10, MPT-64 are useful as diagnostic agents, and in particular 
are a source of diagnostic peptides. Suitably the polypeptide 
35 is other than polypeptides encoded by the Rvl984c, Rv3871, 

RV3872 or at least certain parts of the Rv3873 regions of the 
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Mycohacterium tuherculosxs r Mycobactexlum i>ovis or Mycohac-terium 
afxicanum genomes. 

The term ''^polypeptide'' as used herein includes long chain 
5 peptides including proteins and epitopic fragments thereof. 
Such proteins generally comprise one or more chains of amino 
acids joined covalently through peptide bonds and are typically 
greater than 10,000 MW. 

z 

10 The term ^^peptides'' refers to small proteins . (generally less 

than about 10,000 MW) , and in particular to smaller chains, for 
example up to 30 amino acids in length, preferably up to 20 
amino acids in length. Also included however are small 
oligopeptides comprising three or more amino acid residues 

15 covalently linked through peptide bonds. Peptides will 

generally coir$>rise two or more amino acid residues linked 
together covalently through peptide bonds. 

The polypeptide from which the diagnostic agents are selected 
20 are preferably encoded by the Mycohacterium tuberculosis genome 
and coir^rise a member of the PE/PPE protein family. 

The term ^^derived from" as used herein means any polypeptide or 
peptide encoded by an open reading frame from the specified 
25 regions of the Mycohacizex-ium -tuberculosis^ Mycobacterium bovis 
or Mycobacterium africanum genomes - 

In particular,' the applicants utilise peptides encoded by 
fragments of the open reading frames and variants thereof as 
30 long as such peptides are still capable of being used as 

diagnostic reagents. In particular, they will comprise epitopic 
sequences . 

According to the present invention there is provided a 
35 diagnostic reagent comprising a peptide comprising an epitope 
from at least one polypeptide selected from Rvl986 (SEQ ID NO 
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1), RV3878 (SEQ ID NO 3), Rv 1983 (SEQ ID NO 4), Rv3873 (SEQ ID 
NO 5) or Rv3879 (SEQ ID NO 6) . 

An ^''epitope" of the sequence comprises those amino acids that 
5 are necessary to generate in immune response, and therefore be 
recognised in a diagnostic test. They may be consecutive amino 
acids, or they may be spaced apart from one another- In the 
latter case, the natirre of the amino acids between the amino 
acids of the epitope may not be crucial to the activity and may 

10 be varied- Determination of the amino acids which comprise the 
epitopes can be determined using routine methods, for example by 
finding antigenic regions or fragments,, as illustrated 
hereinafter, and then carrying out a series of small mutations, 
for example, point mutations, and then determining whether the 

15 immunogenic or diagnostic activity has been retained. Where it 
has, then the variant retains the epitope. If activity has been 
lost, then the mutation has disrupted the epitope and so must be 
reversed - 

20 Suitably, the peptide comprises a series of consecutive amino 

acids from within the polypeptide sequence. In one embodiment, 
the polypeptide from which the peptide is derived comprises the 
sequence shown in SEQ ID Nos 1, 3, 4 or 6. 

25 Alternatively, the polypeptide from which the peptide is derived 
comprises the sequence shown in SEQ ID Nos 3, 5 or 6. 

Particular examples of diagnostic reagents comprise peptides 
which include an epitope from SEQ ID NO 23 as shown in Figure 9 
30 hereinafter, which is a fragment of SEQ ID NO 5, and in 

particular, an epitope found within SEQ ID NO 25 or SEQ ID NO 7, 
or within SEQ ID NOs 28 and 29. SEQ ID NO 23, and fragments 
thereof, such as SEQ ID NOS 7,25, 28 and 29 form particular 
embodiments of the diagnostic reagents of the invention - 
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Other particular examples of diagnostic reagents comprise 
peptides which include an epitope from SEQ ID NO 35, shown in 
Figure 10 hereinafter, which is a fragment of SEQ ID NO 3- 
SEQ ID NO 35 or fragments thereof, form a particular embodiment 
5 of the invention- 

Particular examples of diagnostic reagents comprise peptides 
which include an epitope from SEQ ID NO 48, shown in Figure 11 
hereinafter, which is a fragment of SEQ ID NO 6, and in 
10 particular, an epitope found within SEQ ID NO 51. SEQ ID NO 48 
or variants thereof, or fragments of these, in particular SEQ ID 
NO 51 form particular embodiments of the diagnostic reagents of 
the invention- 

15 The polypeptides themselves, as well as homologues or variants 
thereof may also be used as diagnostic agents, but preferably 
fragments (comprising peptides are employed) . 

The peptides described above may be used in either specific or 
20 differential diagnostic tests. 

The term '^^fragment thereof" as used herein in relation to an 
amino acid sequence refers to any portion of the given amino 
acid sequence which has the same activity as the complete amino 
25 acid sequence. Fragments will suitably comprise at least 10 and 
preferably at least 20 consecutive amino acids from the basic 
sequence. In one embodiment, the fragment sequence comprises 17 
amino acids - 

30 Fragments of the polypeptide include deletion mutants and 

polypeptides where small regions of the polypeptides are joined 
together. The fragments should contain an epitope, and 
preferably contain at least one antigenic region. 
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The term "homologue'' refers to similar genes found in other 
organisms, such as the J^ycojbacterium Jbovis or KycoJbacterium 
afrlcanum genomes, 

5 The term variant thereof as used herein in relation to an 

amino acid sequence means sequences of amino acids which differ 
from the base sequence from which they are derived in that one 
or more amino acids within the sequence are siabstituted for 
other amino acids- Amino acid substitutions may be regarded as 
10 ^^conservative" where an amino acid is replaced with a different 
amino acid with broadly similar properties. Non-conservative 
substitutions are where amino acids are replaced with amino 
acids of a different type- 

15 By '"conservative substitution" is meant the substitution of an 
amino acid by another one of the same class; the classes being 
as followii: 

CLASS EXAMPliES OF AMINO ACID 

Nonpolar: Ala, Val, Leu, lie. Pro, Met, Phe, Trp 

2 0 Uncharged polar: Gly, Ser, Thr, Cys, Tyr, Asn, Gin 

Acidic: Asp, Glu 

Basic: l^ys, Arg, His 

As is well known to those skilled in the art, altering the 
primary structure of a peptide by a conservative substitution 

25 may not significantly alter the activity of that peptide because 
the side-chain of the amino acid which is inserted into the 
sequence may be able to form similar bonds and contacts as the 
side chain of the amino acid which has been siabstituted out. 
This is so even when the substitution is in a region, which is 

30 critical in deteirmining the peptides conformation. 

Non-conservative substitutions are possible provided that these 
do not interrupt with the antigenicity of the polypeptide - 
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Broadly speaking, fewer non-conservative siibstitutions will be 
possible without altering the biological activity of the 
polypeptide. Suitably, variants will be at least 50% identical, 
60% identical, preferably at least 75% identical, and more 
5 preferably at least 90% identical to the base sequence. 

Identity in this instance can be judged for. example using the 
algorithm of Lipman-Pearson, with Ktuple:2, gap penalty: 4, Gap 
Length Penalty: 12, standard PAM scoring matrix (Lipman, D.J. and 
10 Pearson, W.R., Rapid and Sensitive Protein Similarity Searches, 
Sciei3ce, 1985, vol. 227, 1435-1441). 

The applicants have found that a diagnostic test based upon SEQ 
ID NOS 1 and 3 or homologues or variants thereof, can give a 

15 differential diagnostic test, which in particular, can 

differentiate between tuberculosis-infected and tuberculosis 
vaccinated mammals . Selection of a diagnostic reagent 
comprising or derived from these sequences can be made so that 
they differentiate between Mycohacterlim JDoviSr Mycohacterlum 

20 tuheTculosls or J^ycojbacterlmn atrlcanum -infected mammals and 
mammals vaccinated # against Mycobacterium hovisr MycoJbactejrium 
tuherculosis or Mycobacterium africanum. 

Alternatively, the applicants have found that diagnostic tests 
25 based upon SEQ ID Nos 3 or 6, or a homologue or variant thereof, 
• can also provide differential diagnostic tests, but in this case 
they are able to distinguish between mammals, which are either 
vaccinated against or infected by tuberculosis and mammals, 
sensitised by environmental mycobacteria- The diagnostic 
30 reagent used in a specific diagnostic test preferably 

differentiates between Mycobacterium bovis -infected and mammals 
sensitised by environmental mycobacteria. 

Thus the applicants have found a range of diagnostic peptides 
35 derived from an RDl or RD2 region of the Mycobacterium 

tuberculosis r Mycobacterium bovis or Mycobacterium africaixum 
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genomes, or a variant, homologue or fragment of these, which are 
additional to those .derived from the ESAT-6 or CFP~10 
polypeptides • 

5 These peptides are capable of being used as diagnostic reagents 
and are preferably synthetic peptides having the advantages 
discussed above. 

One such peptide is a peptide derived from SEQ ID NO. 5, which is 
10 shown in Figure 6 as SEQ ID NO 7. Fragments, homologues and 

variants of this peptide are also included herein. The peptide 
as shown in SEQ' ID NO 7 may be used in a specific diagnostic 
test to differentiate between those mammals, which are either 
vaccinated against or infected by tuberculosis, and those 
15 mammals which have been sensitised by environmental 

mycobacteria. In particular, the peptide is especially useful 
in differentiating between MycohactBrlum jbovis- infected mammals, 
such as cattle or calves, and those animals sensitised by 
environmental bacteria* 

20 

Such peptides may be used as diagnostic reagents, either on 
their own or preferably with one or more other peptides, which 
may be other peptides according to the present invention, or 
different peptides, in order to achieve greater sensitivity and 

25 specificity of a diagnostic test. For example, protein or 

peptide cocktails composed of other antigens from the RDl or RD2 
regions of the Mycobacterium hovlsjr Mycobacterima tiiberculosds 
or MycohactGTima africanum genomes may be used in addition to 
the antigens of the present invention in order to enhance the 

30 specificity of the diagnostic test. In particular, peptide 

cocktails may include peptides derived from the antigens, ESAT-6 
and CFP-10, as well as those described above. 

According to a ftirther aspect of the present invention there is 
35 provided a diagnostic kit comprising at least two diagnostic 
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reagents, at least one of which is a diagnostic reagent as 
described above. 

In particular, kits of the invention will comprise polypeptides 
5 or peptides, at least one of which is selected from a 

polypeptide derived from the sequences shown as SEQ ID Nos 1, 3, 
4, 6 and 7, or a fragment, homologue or variant thereof, and 
optionally at least one polypeptide derived from the sequences 
shown as SEQ ID Nos 2 and 5, and optionally one or more 
10 reagents. Such kits may be used to differentiate between 
tuberculosis-infected and tuberculosis-vaccinated mammals . 

The polypeptide and peptide sequences described herein. can 
provide a means for detecting the recognition of the 

15 polypeptides or peptides by the T-cell- Preferably, diagnostic 
reagents utilised in the diagnostic kit are able to 
differentiate between MycoJbacteTXvm ho^rlSr Mycobacterium 
tuherculosxs or Mycohstcterlum afrlcanvm -infected mammals and 
mammals vaccinated against Mycohact&Tlum hovisr Mycobacterium 

2 0 tuberculosis or Mycobacterium afrlcanum. 

Where the kit is intended to be used to differentiate between 
those mammals infected by Mycobacterium bovls^ Mycobacterium 
tuberculosis or Mycobactrlum afrlcani2m and those mammals which 

25 have been vaccinated against Mycobacterium bcvls, Mycobacterium 
tuberculosis or Mycobacterium afrlcanum,r the kit will preferably 
con^rise the polypeptides derived from the sequences shown as 
SEQ ID Nos 1, 2 and 3, or a fragment, homologue or variant 
thereof- In particular, it will comprise diagnostic reagents 

30 coitprising peptides comprising an epitope from these sequences - 

Where the kit is intended to be used to differentiate between 
those mammals infected by Mycobacterium bovls^ Mycobacterium 
tuberculosis or Mycobactrlum afrlcanum and mammals sensitised by 
35 environmental mycobacteria, the )cit will preferably comprise 

polypeptides or peptides derived from the sequences shown as SEQ 
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ID Nos 4, 5, 6, and optionally, SEQ ID NO. 7 or a fragment, 
homologue or variant thereof. In particular, it will comprise 
diagnostic reagents coinprising peptides comprising an epitope 
from these sequences. 

5 

The diagnostic kit may also comprise one or more polypeptides or 
peptides from the RDl region of the Mycohactorivm l>oviSr 
MycohactGrium tuberculosis or HycoJbacteriim a^ricanum genomes. 
Protein or peptide • cocktails composed of such polypeptides may 
10 also be used. Especially preferred are peptide cocktails 

composed of the ESAT-6 and/ or the CFP-10 polypeptides. Such 
peptide cocktails may be used to enhance the sensitivity of the 
diagnostic tests of the present invention. 

15 AS would be understood, the polypeptides and peptides described 
above are encoded by nucleic acids. Novel nucleic acids, for 
example which encode novel peptides or polypeptides as described 
above form a further aspect of the invention, together with 
fragments homologues or variants thereof- 



20 



25 



The nucleic acid may be DNA or RNA, and where it is a DNA 
molecule, it may comprise a cDNA or genomic DNA. These nucleic 
acids may themselves be useful as vaccines and such vaccines 
form a further aspect of the present invention. 

Preferably, the nucleic acid comprises the sequence shown in SEQ 
ID Nos 8, 10, 11 or 13, or a variant or fragment thereof. 



The term ^"fragment thereof" as used herein in relation to a 
30 nucleic acid or polynucleotide sequence refers to any portion of 
the given polynucleotide sequence which exhibits the same 
activity as the complete pol:in:iucleotide sequence. Fragments 
will suitably comprise at least 15, preferably at least 30 and 
more preferably at least 60 consecutive bases from the basic 
35 sequence. 
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The term ^''variant thereof" in relation to a polynucleotide or 
nucleic acid sequences means any siabstitution of, variation of, 
modification of, replacement of deletion of, or the addition of. 
one or more nucleic acid(s) from or to a polynucleotide sequence 
5 providing the resultant protein sequence encoded by the 

polynucleotide exhibits the same properties as the protein 
encoded by the basic sequence. The term therefore includes 
allelic variants and also includes a polynucleotide which 
siabstantially hybridises to the polynucleotide sequence of the 

10 present invention- Preferably, such hybridisation occurs at, or 
between low and high stringency conditions. In general terms, 
low stringency conditions can be defined as 3 x SSC at about 
ambient temperature to about 5 5^C and high stringency condition 
as 0.1 X SSC at about 65^C. SSC is the name of the buffer of 

15 0.15M NaCl. 0.015M tri-sodium citrate. 3 x SSC is three times 
as strong as SSC and so on. 

Typically, variants have 62% or more of the nucleotides in 
common with the polynucleotide sequence of the present 
20 invention, more typically 65%, preferably 70%, even more 

preferably 80% or 85% and, especially preferred are 90%, 95%, 
98% or 99% or more identity. 

When comparing nucleic acid sequences for the purposes of 
25 determ±ning the degree of identity, programs such as BESTFIT and 
GAP (both from Wisconsin Genetics Computer Group (GCG) software 
package) . BESTFIT, for example, compares two sequences and 
produces an optimal alignment of the most similar segments. GAP 
enables sequences to be aligned along their whole length and 
30 fins the optimal alignment by inserting spaces in either 
sequence as appropriate- Suitably, in the context of the 
present invention when discussing identity of nucleic acid 
sequences, the comparison is made by alignment of the sequences 
along their whole length. 

35 
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Generally speaking, diagnosis of infection in a host, or 
exposure of a host, to a mycobacterium can be carried out by 
i) contacting a population of cells from the host with a 
polypeptide derived frbm an RDl or RD2 region of the 
5 MycobactBTdum tubsrculosis , Mycobacterium bovis or Mycobactorlim 
afjricanum genomes, or a variant, homologue or fragment of these, 
which polypeptide may be used as a diagnostic reagent, including 
those described above, in addition to those derived from ESAT-6, 

CFP-10, MPT-64; and 
10 ii) determining whether the cells of said cell population 
recognise the polypeptide or fragment or variant thereof. 

Thus in accordance with the invention there is provided a method 
of diagnosising infection in a host, or exposure of a host, to a 
15 mycobacterium, said method comprising 

i) contacting a population of cells from the host with a 
diagnostic reagent according to the invention; and 

ii) determining whether the cells of said cell population 
recognise the diagnostic reagent. 



20 



Suitable diagnostic reagents are as described above 



The population of cells used in the method is siaitably a 
population of T-cells. The method preferably diagnoses 
25 infection by Mycobacterium tuberculosis, Mycobacterium bovis or 
Mycobacterium africanum. 

The diagnostic reagents and peptides described above can be 
used, in accordance with a further aspect of the invention to 
30 produce an antibody specific to the peptide. 

Polypeptides of the invention may be isolated from strains of M. 
bovis, M. tuberculosis or M. africanum. Preferably, they are 
prepared synthetically using conventional peptide synthesisers. 
35 Alternatively, they may be produced using recombinant DNA 

tecJinology or isolated from natural sources followed by any 
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chemical modification, if required. In these cases, nucleic 
acids encoding the polypeptides are incorporated into suitable 
expression vectors, which are then used to transfoirm a suitable 
host cell, such as a prokaryotic cell such as E. coll* The 
5 transformed cells are cultured and the polypeptide isolated 

* 

therefrom. Vectors, cells and methods of this type form further 
aspects of the present invention. 

A particular diagnostic kit comprising the polypeptides derived 
10 from the sequences shown as SEQ ID Nos 1 to 3 and further 

comprising one or more polypeptides derived from the RDl region 
of J^ycojbacteri 12222 tuberculosis^ Mycohacterli2m bovls or 
i^ycoJbacteriujn afrlcanum,^ and optionally one or more reagents, 
for differentiating between cattle infected by hovls and 
15 cattle which have been vaccinated with BCG or with a vaccine 
according to the present invention - 

A specific diagnostic kit comprises the polypeptides and 
peptides derived from the sequences shown as SEQ ID Isfos 4 to 7 

20 and further comprising one or more polypeptides derived from the 
RDl region of MycoJostcterlum tuberculosis jr Mycobacterium bovls or 
Kycojbacteriujn afrlcanumr and optionally one or more reagents, 
for differentiating between cattle which have either been 
vaccinated against or infected by H. bovls and those cattle 

25 which have been sensitised by environmental mycobacteria. 

A further preferred embodiment of the present invention is a 
vaccine comprising a peptide having the sequence shown in SEQ ID 
Wo 7. 

30 

An advantage of the present invention is that the level of 
sensitivity achieved in diagnostic tests with these antigens is 
higher than the sensitivity achieved with the antigens ESAT~6 
and CFP-10- In addition, the level of specificity of the 
35 antigen of the present invention is higher than that of PPD, 
which is currently used. PPD has the disadvantage that it 
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cross-reacts with other environmental mycobacterial species and 
the vaccine strain H. Jbovis Bacille Calmette-Guerin (BCG) . Such 
diagnostid tests will enable the transfer from skin testing 
regimes to vaccine regimes to be implemented. 

5 

A further advantage of the present invention is the provision of 
a test which can distinguish between those mammals that have 
been vaccinated against tuberculosis, and in particular M. 
hovlsr and those which have been infected with M. hovels. This 
10 allows the selective slaughter of animals which would appear 

from current tests to all be infected, thereby saving the lives 
of many animals. 

The applicants have also found that certain diagnostic reagents 
15 as described above, as well as polypeptides from which they are 
derived as well as some additional polypeptides, such as those 
shown in Figure 7 and SEQ ID NO 14 and 15 respectively, as well 
as variants and fragments thereof, produce a protective immune 
response, and therefore may be used as vaccines. 

20 

Thus tbe invention' f\irther provides a polypeptide comprising any 
one of SEQ ID NO 1, 2, 3, 4, 5 or 6, or variants thereof, or 
fragments of any of these, which produce a protective immiane 
response in a mammal to whom they are administered, for use as a 
25 vaccine. 

In addition, the present invention provides a polypeptide 
derived from an RD2 or RD14 region of the Mycoi^acterium 
tuherculosis ^ MycohactBrlum ho^ls or Mycohact&xlum afrlcanum 
30 genomes, or a variant thereof, or a fragment of any of these, 
for use as a medicament, with the proviso that the polypeptide 
is not a MPT-64 polypeptide or a polypeptide encoded by the 
Rvl984c region of the Mycobacterium tubBTculosiSr Mycohacterium 
hovls or Mycoi>actBr±rm afrlcanum genomes. 

35 
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These polypeptides, or variants or fragments, are preferably 
used as a vaccine against tuberculosis caused by Mycobacterium 
tuhGXculosis r Mycohacterium 2>ovis or Mycohact&rlum africanum. 

Suitably the polypeptide of the invention is derived from the 
MycohactGrium -tuherculQsis genome - 

In particular the polypeptide of the invention comprises the 
sequence shown in SEQ ID Nos 14 or 15, or a variant thereof or 
fragment thereof. Most preferably the polypeptide is of SEQ ID 
NO 14 or 15 or an epitopic fragment thereof. 

In particular, the invention provides a polypeptide comprising a 
fusion of a region of SEQ ID NO 14 and a region of SEQ ID NO 15, 
which fusion polypeptide is able to produce a protective immune 
response in a mammal to which it is administered. Particular 
examples of fusion polypeptides are illustrated hereinafter, as 
SEQ ID NOs 18, 20 and 22, and these, together with protective 
variants and fragments thereof form preferred embodiments of the 
invention. 

Polypeptides which 'are protective are protective against 
tuberculosis infection and therefore may be used as a 
prophylactic or therapeutic vaccine, and these form a fiirther 
aspect of the invention. 

Therefore, particular vaccines comprise a polypeptide derived 
from an RD2 or RD14 region of the MycoJbacteriuxn tuhsrculosds r 
Mycohacterima jDOvis or J^ifycojbacterium afxxcanum genomes, or a 
variant thereof or a fragment of any of these, which polypeptide 
produces a protective immune response against tuberculosis 
infection in a mamnnal to which it is administered, with the 
proviso that the polypeptide is not a polypeptide or a 

polypeptide encoded by the Rvl984c region of the MycohactBxium 
tuherculosiSr MycoJ^acterium hovis or Mycohacterium africanum 
genomes. 
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The vaccine is preferably used to vaccinate against 
tuberculosis. It may be used as a vaccine against tuberculosis 
in humans, cattle and other mammals including guinea pigs, 
badgers, possums and deer. It is, however, preferably used as a 
5 vaccine in cattle. 

Preferably, the vaccine comprises one or more protein subunits. 

Alternatively, it may comprise a nucleic acid such as a DNA or 
10 cDNA encoding for the protein or protein subunits. 

When it conprises a nucleic acid, this is suitably incorporated 
into an expression vector, in such as way that the protein 
subunit is expressed in the host animal. For example, the 

15 nucleic acid may be incorporated into a virus vector such as a 
vaccinia or adenovirus vector, or a plasmid to form a so-called 
^^naked DNA" vaccine- The vector may contain the usual 
expression control functions such as promoters, enhancers and 
signal sequences, as well as selection markers in order to allow 

20 detection of successful transf oirmants . The nature of these will 
depend upon the precise nature of the vector chosen and will be 
known to or readily determinable by a person skilled in the art. 

Preferably, vaccine coirpositions will further comprise an 
25 adjuvant such as in order to enhance the immune response to the 
biologically active material administered. Suitable adjuvants 
include pharmaceutically acceptable adjuvants such as Freund' s 
incomplete adjuvant, alviminium compounds and, preferably 
adjuvants which are known to up-regulate mucosal responses such 
30 as CTB, the non- toxic pentameric B subunit of cholera toxin 
(CT) . 

According to a further aspect of the present invention, there is 
provided a nucleic acid encoding a polypeptide of the invention 
35 or a fragment or variant thereof. The nucleic acid may be DNA 

or RNA and where it is a DNA molecule, it may comprise a cDNA or 
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genomic DNA. These nucleic acids may themselves be useful as 
vaccines . 

Preferably, the nucleic acids of the present invention are those 
5 shown as SEQ ID Nos 59 and 60, as well as or a variant or 

fragments thereof. One such variant, which encodes a fusion is 
SEQ ID NO 17. 

According to a yet a fiarther aspect of the invention, there is 
10 provided a pharmaceutical or veterinary composition comprising a 
protective polypeptide as described above, or a nucleic acid 
which encodes this, in combination with a pharmaceutically or 
veterinarily acceptable carrier. 

15 The carriers may be solid or liquid as understood in the art. 
They may be obtained by conventional procedures using 
conventional pharmaceutical excipients, well known in the art. 

In particular, the compositions of the invention may be in a 
20 form suitable for oral use (for example as tablets, lozenges, 
hard or soft capsules, aqueous or oily suspensions, emulsions, 
dispersible powders or granules, syrups or elixirs), for 
administration by inhalation (for example as a finely divided 
powder or a liquid aerosol), for administration by insufflation 
25 (for exaitple as a finely divided powder) or for parenteral 
administration (for example as a sterile aqueous or oily 
solution for intravenous, subcutaneous, intramuscular or 
intramuscular dosing or as a suppository for rectal dosing. 

30 The pharmaceutical or veterinary compositions are preferably in 
the form of a sterile injectable aqueous or oily suspension, • 
which may be f ojnnulated according to known procedures using one 
or more of the appropriate dispersing or wetting agents and 
suspending agents, which have been mentioned above. A sterile 

35 injectable preparation may also be a sterile injectable solution 
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or suspension in a non-toxic parenterally-acceptable diluent or 
solvent, for example a solution in 1,3-butanediol. 

Where the compositions of the invention comprise a nucleic acid, 
5 they are preferably formulated for parenteral administration and 
in particular intramuscular injection, although other means of 
application are possible as described in the pharmaceutical 
literature, for example administration using a Gene Gun, 
(Bennett et al., (2000), Vaccine 18, 18.93-1901). Oral or intra- 
10 nasally delivered formulations are also possible. Such 

formulations include delivery of the plasmid DNA via a bacterial 
vector such as species of Salmonella or Listeria (Sizemore et al 
(1997). Vaccine 15, 804-807). 

15 Formulation techniques generally are well known and are 
described for example in Chapter 25.2 in Voliame 5 of 
Comprehensive Medicinal Chemistry (Corwin Hansch; Chairman of 
Editorial Board), Pergamon Press 1990. 

20 The amount of active ingredient that is combined with one or 

more excipients to. produce a single dosage form will necessarily 
vary depending upon the host treated and the particular route of 
administration. Dosage unit forms will generally contain about 1 
mg to about 500 mg of an active ingredient. 



25 



30 



The size of the dose for therapeutic or prophylactic purposes of 
the composition of the invention will naturally vary according 
to the age and sex of the animal or patient and the nature of 
the active component and the route of administration, according 
to well known principles of medicine. Generally speaking 
however, for administration to a human as a prophylactic 
vaccine, dosage units of from 0.25 jag to 2.5mg will be typically 
employed * 

35 In yet a fiorther aspect, the invention provides a method of 
protecting a mammal against infection against Mycobacterium 
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Jbovis, MycoJbacterium tuhBrculosis or MycohactBrlum africanum 
comprising administering to said mammal a polypeptide, a nucleic 
acid or a composition as described above. Where the polypeptide 
is a pharmaceutical or veterinary coitpositionr the polypeptide 
5 may be administered directly- Alternatively, a nucleic acid 

encoding the polypeptides is administered to a mammal in a form 
in which it is expressed in situ. 

According to another aspect of the present invention, there is 
10 provided a peptide derived from an RD2 or RD14 region of the 

MycolDacteTium -tuberculosis r Mycohacterlum JdovIs or Mycohacterlum 
afxlcanum genomes, or a variant thereof, or a fragment of any of 
these, which peptide produces a protective immune response 
against tuberculosis infection in a mammal to which it is 
1 5 administered - 

Also encompassed by the . present invention are peptides derived 
from the pol2^eptides of the present invention. Such peptides 
are preferably synthetic peptides. 

20 

According to yet another aspect of the present invention, there 
is provided the use of a polypeptide or peptide according to the 
present invention in the preparation of a vaccine. 

25 The polypeptide may for example, have the sequence shown in SEQ 
ID NO. 7 or an epitopic fragment thereof. 

The vaccine is' preferably used to vaccinate against 
tuberculosis - It may be used as a vaccine against tiiberculosis 
30 in both humans and cattle. It is, however, preferably used as a 
vaccine in cattle. 

Preferably, the vaccine coirprises protein subunits. 
Alternatively, it may comprise DNA or cDNA encoding for the 
35 subunits. 
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According to a further aspect of the present invention, there is 
provided a method of protecting a mainmal against infection by 
Mycobacterium Jbovis, WycoJbacteri um tuberculosis or Kycoi^acteriuon 
africanuin comprising administering to said mammal a polypeptide, 
5 peptide or pharmaceutical or veterinary composition according to 
the present invention which produces an immune response against 
Mycojbacterium Jbovis, Mycobacterium tuberculosis or J^ycoijactexiuin 

SLfTXcaumm. 

LO cattle models of M. bovis infection and BCG vaccination were 
studied to identify highly immunogenic antigens from three 
genomic regions deleted in BCG Pasteur (RDl, KD2, RD14) [Behr, 
1999 supra., Mahairas, 1996 supra] that could be useful as 
specific diagnostic reagents or subunit vaccine candidates. 

15 Five hundred and thirty six overlapping synthetic peptides 

derived from the sequence of 13 antigens (open reading frames) 
encoded in these regions were synthesised and used to diagnose 
infected or vaccinated cattle. The previously mentioned ESAT- 
6/CFP-lO peptide cocktail was also included as a gold standard 

20 with which to coir5>are and all tests performed used the BOVIGAM 
ELISA for the detection of bovine IFN-y- 

The present invention will now be described only by way of 
example, in which reference shall be made to the Figures, in 
25 which: 

Figure 1 shows the recognition of RDl products by a M. bovis 
infected cow (A, C and E) and a PPD-A reactor (B, D and F) . 
Whole blood was cultiared in the presence of peptide pools of 
30 between 8-11 peptides representing RDl (A and B) , RD2 (C and D) 
and RD14 (E and F) at 5ng each peptide/ml. Dashed horizontal 
lines indicate positive cut-off (OD450 values with antigens minus 
OD450 without antigens > 0.1); 

35 Figure 2 shows IFN-y responses induced by RD region antigens by 
M. bovis infected (22), BCG vaccinated (6) and PP-A reactor 
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cattle (10) - Only the results of the pool/antigen stimulating 
the greatest IFN-y response are shown. Green squares represent 
hovis infected cattle, red triangles represent PP~A reactors 
and blue circles represent BCG vaccinated cattle- Dashed 
5 horizontal line indicate the positive cut Oof f (OD450 values with 
antigens minus OD450 without antigens >0 . 1 ) . 

Figure 3 shows IFN-y secretion induced by individual peptides 
from pool 3 (A) and pool 26 (B) in whole blood cultures from two 

10 representative animals- Whole blood was collected from M. hovis 
experimentally infected cattle and incubated for 48hrs with 
peptides . (25ug/ial each) . Results are expressed as delta mean 
optical density OD450 values with antigens ininus OD450 without 
antigens) of duplicate determinations, with a positive cut-off 

15 of 0-1. 

Figure 4 shows the antigens selected for evaluation. 
Figure 5 shows the most frequently recognised antigen. 

20 

Figure 6 shows the sequence homology between peptide 3.2 from 
Rv3873 (shown as SEQ ID NO- 7) with other mycobacterial 
proteins - 

25 Figure 7 shows the amino acid sequences which correspond to the 
open reading frames Rvl979c, Rvl769cr Rvl986, Rv3872, Rv3878, 
RV1983, Rv3873 and Rv3879c which are shown as SEQ ID Nos 1 to 6, 
the amino acid sequences of the antigens which are particular 
vaccine candidates and in particular the Rv 1979 and Rvl7 69 

30 antigens which are shown as SEQ ID Nos 14 and 15 respectively. 

Figure 8 shows the nucleotide sequences of the Rvl979c, Rvl769Cr 
RV1986, RV3872, Rv3878, Rvl983, Rv3873 and Rv3879c antigens 
whose coding sequences are shown as SEQ ID Nos 8 to 13, and 
35 whose genomic sequences are illustrated as SEQ ID Nos 61-66 
respectively. 
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• Figure 9 shows a diagnostic cocktail derived from SEQ ID NO 5 . 
Figure 10 shows a diagnostic cocktail derived from SEQ ID NO 3. 

5 

Figure 11 shows a diagnostic cocktail derived from SEQ ID NO 6. 

Figure 12 shows the coding nucleotide sequences of the Rv 197 9 
and 17 69 antigens which are shown as SEQ ID Nos 59 and 60, 
10 respectively, together with the genomic sequences 70 and 71 
respectively . 

Figure 13 shows the nucleotide coding and aitiino acid sequences 
of a novel vaccine as SEQ ID NOs 17 and 18 respectively. 



15 



20 



Figure 14 shows the first half of a fusion insert from the ORF 
of Rv 1979c (SEQ ID NO 15) and the position \fith the ORF of the 
segment to be fused in the vaccine (bold) . 

Figure 15 shows the second half of a fusion insert from the ORF 
of RV 17 69 (SEQ ID- NO 16) and the position with the ORF of the 
segment to be fused in the vaccine (bold) . 

Example 1 

25 Diagnostic tests 

The following results demonstrate that six antigens showed 
promise as diagnostic antigens with regard to their specificity, 
and that two more could be considered as potential vaccine 
candidates because they were highly iramvinogenic in all groups 

30 assayed. 

MATERIiU^S AND METHODS 

Cattae. Ca. 6 months old calves (Friesian or Friesian crosses) 
35 were obtained from herds free of bovine t\jberculosis . 
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The following groups of cattle were used in this study: 

M. 2>av±s ±n£ectxan. Nine calves were infected with a GB M. 
hovis field strain from (AF 2122/97) by intratracheal 
5 instillation of 2 x 10^ CFU as described [Buddie, B., et al, 

1995. Vaccine 13: 1123-30; Buddie, B., et al. 1995. Res- Vet. 
Sci. 59: 10-6; Rhodes SG, et al. 2000. Infect. Immun 68:2573- 
2578] - Twelve calves were infected with an M. hovis field 
strain, isolated from a New Zealand infected cow using also 

10 intratracheal instillation (5 x 10^ CFU) . Bovine tuberculosis 
was confirmed in these animals by the presence of visible 
lessions in lymph nodes and lungs found at post-mortem 
examinations, by the histo-pathological examination of lesioned 
tissues and the culture of M. bovis from tissue samples 

15 collected from lymph nodes and lungs. Heparinised blood sanples 
were obtained between 14-20 weeks after infection when strong 
and sustained in vitro tuberculin responses were observed- Data 
from a total of 21 experimentally infected cattle are presented 
in this study. One naturally infected animal was also used 

20 included in this group. 

BCG jraacinatxan^ Calves were vaccinated with BCG Pasteur by 
subcutaneous injection of 10^ CFU into the side of the neck 
followed 8 weeks later by a booster injection using the same 
25 route and dose [Buddie, 1995 supra.; Vordermeier, H. M. , et al. 
1999. Clin. Diagn. Lab. Immunol. 6:675-682]. Heparinised blood 
samples were taken between 4-6 weeks after the booster 
vaccination. Data from -6 calves will be presented in this 
study. 

30 

UxxlnfGcizGd controls, Heparinised blood from tiiberculin skin 
test-negative calves from herds free of BTB (10 animals) was 
also obtained. These animals produced IFN-y in vitro after 
stimulation with tuberculin from M. avium indicating that they 
35 have been exposed to environmental mycobacteria. 
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Antigens and peptides 

Antigens: Bovine (PPD-B) and avian (PPD-A) tuberculins were 
obtained from the Tuberculin Production Units at the Veterinary 
Laboratories Agency-Weybridge and used in culture at 10 ng/ml. 

Peptides: A set of five hundred and fifty two synthetic 
peptides spanning 13 open reading frames (20 residues long with 
a 12 residue overlap) was prepared by Multi-rod peptide 
synthesis. These were used in mapping experiments in pools of 
10 peptides at 5pig each peptide/ml and 25^g/ml when used 
individually. The peptides were purchased from Chiron Mimotopes 
(Clayton, Australia) . ESAT-6 and CFP-10 derived peptides were 
synthesised by solid phase peptide synthesis and formulated into 
a peptide cocktail as described earlier [Vordermeier , H. M. et 
al. 2001. Clin. Diagn. Lab. Immunol. 8:571-8]. They were also 
used at 5(ig each peptide/ml. Peptide purity and sequence 
fidelity of ESAT-6 and CFP-10 derived peptides was confirmed by 
analytical reverse-phase HPLC and by electron-spray mass 
spectrometry, respectively- 

Xntsr£ex:on-gamnaSLXSA. Whole blood cultures were performed in 
96-well plates in 0.2ml/well aliquots by mixing 0.1 ml of 
heparinised blood with an equal volume of antigen containing 
solution [Vordermeier, 1999 supra.]. Supernatants were 
harvested after 48 h of culture at 37°C/5% CO^ in a humidified 
incubator. Interferon-gamma (IFN-y) concentration was 
determined using BOVIGAM- ELISA kit (Biocore AH, Omaha, NE) . 
Results were deemed positive when the OD450 [PPD-B] minus OD450 
values with antigens minus OD.50 value without antigens were > 
0.1. For conparative analysis of PPD-B vs. PPD-A responses, a 
positive result was defined by an OD450 [PPD-A >0.1, and OD450 
[PPD-B] minus OD450 [unstimulated] >0.1. 



35 The DNA sequence of M. tvberaulosis H37Rv was visualised using 

either the ARTEMIS display tool [Rutherford, K. , J- et al. 2000. 
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Artemis: Sequence visualisation and annotation. Bioinf ojnnatics 
BIO: 944 - 5] or the Tuber cuList database 

(http://genolist.pasteur.fr/TubercTaIiist/) BLAST searches were 
performed from within TubercuList, or using the NCBI BLAST 

5 server 

( http: //www.ncbi/nlm.nih.qov/BLAST ) 

RESULTS 

Selection of candidate antigens from the RDl, RD2, and RD14 

10 regions of Jbovds 

Thirteen ORFs from the RDl, RD2 and RD14 regions of H. i?ovis 
were selected for screening. These regions are deleted in BCG 
Pasteur and proteins encoded within these regions hold promise 
as candidate antigens for the differential diagnosis of M. l?ovis 

15 infected animals from BCG vaccinated cattle and as potential 

vaccine candidates- Selection criteria were that the ORF should 
encode a protein that either (i) showed no, or minimal, sequence 
similarity to other proteins in M. tuberculosis or other 
organisms, (ii) belonged to the PE or PPE protein family, (iii) 

20 had the potential of being induced or upregulated in vivo (e.g. 
amino acid transporters) , or (iv) had the potential to be 
secreted. The designations of the antigens encoded by the 
selected ORF (Rv number) , their sizes, and putative functions 
are listed in Figure 4. 

25 

Immxinogenicity of selected antigens in M. hovls infected, BCG 
vaccinated and environmentally sensitised cattle 

Five hundred and thirty six overlapping peptides derived from 
the sequences of these antigens were synthesised. Peptides were 

30 then formulated into, pools of approximately 10 neighbouring 
overlapping peptides, which resulted in 52 peptide pools. 
Figure 4 indicates the pool in relation to the antigens they 
represent as well as the total number of peptides/antigen 
required to ensure complete sequence coverage. Blood samples 

35 Were obtained from 22 M. hovis infected animals, 6 m. bovis BCG 
Pasteur vaccinated animals and 10 un-vaccinated/un-inf ected 
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controls; Whole blood cultures in the presence of PPD-B, PPD-A, 
peptide pools and a cocktail of 10 synthetic peptides derived 
from ESAT-6 and CFP-10, were established and the amount of IFN-y 
determined after 48 h of culture. 

As expected, all M, bovis infected and BCG vaccinated animals 
responded more strongly to bovine tuberculin PPD-B than to avian 
tiaberculin PPD-A (median responses and range: M. bovis infected: 
PPD-B=1. 593 (0.274-3. 500), PPD-A=1 . 313 ( 0. 066-3 . 455 )) ; BCG 
vaccinated: PPD-B=0 . 886 (0 . 181-2 . 244 ) , PPD-A=0 . 5115 (0 . 274- 
2-234)); Uninfected, non-vaccinated control against animals 
responded strongly to avian PPD (PPD-A) indicating that they 
were sensitised by environmental mycobacteria (Median responses 
and ranges: PPDB=0 . 230 (0 . 090-0 . 684) , PPD-A=0 . 686 (0 . 162-1 . 822) ) ; 
they will be described hereinafter as FPD-A reactors. Next the 
immunogenicity of the peptide pools described in Figure 4 was 
assessed. Figure 1 depicts the results obtained with blood from 
two representative animals, one infected with M. bovis, the 
other a PPD-A reactor. The M. bovis infected animal recognised 
at least one peptide pool from each antigen (Figure lA, C, E) , 
indicating that cellular responses were induced after M. bovis 
infection against all 13 antigens selected. In contrast, none 
of the peptide pools induced IFN-y secretion in whole blood from 
the environmentally sensitised PPD-A reactor (Figure IB, D F) . 

The peptide-induced IFN-y responses of all 38 M. bovis infected, 
BCG vaccinated and PPD-A-reactors (uninfected controls) to the 
13 antigens are sxaramarised in Figure 2. When antigens were 
covered by more than one peptide pool, the result of the pool 
stimulating the most IFN-y secretion is shown. Interestingly, 
all 13 antigens were recognised by M. bovis infected cattle all 
be it with the percentage of responding cattle (responder 
frequencies) varying between 21 and 86 %. The most frequently 
recognised antigens were Rv3873, Rv3879c and Rvl769, with 
responder frequencies of 82, 77 and 86% respectively, whereas 
Rvl984c and Rvl772 were recognised only by 21 and 36% of 
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infected calves. Interestingly, several of the most prominently 
recognised antigens were members of the PE/PPE protein family 
(e.g., Rv3873, with ^ responder frequency of 82%). 

5 Surprisingly, considering the absence of the genes encoding 
these antigens in BCG Pasteur, 9 of the 13 antigens tested, 
stimulated a positive response in BCG vaccinated animals 
(RV3873, Rv3879c, Rvl979c, Rvl983, Rvl987, Rvl989c, Rvl768, 
RV17 69 and RV 1772, with a range in responder frequencies of 17 

10 - 100%). The remaining four antigens were recognised by M. 

hovls infected cattle only (Rv3872, Ry3878, Rvl984c, and Rvl986, 
with a range of responder frequencies of 21-59%) . The responder 
frequencies of the 8 most immunogenic antigens are summarised in 
Figure 5. In addition, 21/22 M. Jbovis infected animals 

15 responded to a previously characterised peptide cocktail derived 
from CFP-10 and ESAT-6 (reference) that had been included for 
comparison (median responses and range: 1.281 (0,011-2.825)). 

■ EXZy^PLE 2 

20 The combination of antigens offers inproved. sensitivity 

It is unlikely that a single diagnostic antigen, however 
specific, could impart enough sensitivity to provide population 
coverage; therefore. combinations of specific antigens will be 
needed. It was therefore determined whether such antigen 

25 coitibinations could improve test sensitivity. Two scenarios were 
considered: firstly,, antigens suitable for differential 
diagnosis, i.e./ not recognised by BCG vaccinated animals or 
PPD-A reactors. The three antigens most frequently recognised 
by M. hovis infected animals fulfilling this criteria are 

30 Rvl986, RV3872, and Rv3878 (Figure 5). Combining their results 
indicated that 82% of the infected animals would have been 
correctly identified by their responses to either of these three 
antigens ( Figure 5 ) . 

35 Secondly, we considered the three most immunodominant antigens 
(Rvl983, Rv3873, Rv3879c) that were not recognised by PPD-A 



BNSDOCID: <WO 



.03093307A2_I_> 



wo 03/093307 PCT/GB03/01815 

31 

reactors, but were recognised by BOG vaccinated calves, i.e., 
antigens capable of distinguishing between J^. Jbovis infection 
and animals sensitised by environmental mycobacteria for 
example, M. avium (specific diagnosis) . Taken together, these 
5 antigens would have identified 20/22 (91%) of the M. hovls 

infected animals (Figute 5). Interestingly, if Rv3878 from the 
first category was considered together with Rv3873 and Rv3879c 
from this category, 21/22 (95%) of the hovis infected animals 
would have been detected (Figure 5) • 

10 

EXAMPLE 3 

Responses of peptide pools can be the result: of a single peptide 

The peptide pools formulated contain between 8-11 peptides (see 
Figure 4 for details of peptide pools) . To determine whether 

15 IFN-y responses of pools were due to single or multiple peptide 
constituents, the individual peptides of pool 3 (representing 
residues 89-188 from Rv3873) and pool 2 6 (representing residues 
161-252 from Rvl983) were tested using blood from 5 M. jbovis 
infected animals . All three animals tested that recognised pool 

20 3 responded exclusively to peptide 3.2 (residues 97-116 - SEQ ID 
NO 7), whereas both animals tested that responded to pool 26 
only recognised peptide 26.2 (residues 169-188) . The results 
shown in Figure 3 give results from one representative animal 
responding to pools 3 (Fig- 3A) or 26 (Fig-3B), respectively. 

25 These data suggest that the individual peptides imparting 

antigenicity can be identified from immunodominant pools and 
that pool immunogenicity can be attributed to single peptides. 

The effective use of comparative genomics in combination with 
30 synthetic peptides to identify and screen thirteen potential 
antigens encoded by ORFs located in the RDl, RD2, and RD14 
regions of the H. t.ubBrculos±s has been demonstrated. These 
results indicated that six antigens in particular showed promise 
as diagnostic antigens because they were either (i) recognised 
35 by J^. hovds infected animals alone, but not by BCG vaccinated or 
controls (differential diagnosis. Figure 5) or (ii) by infected 
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animals and vaccinated aniinals but not by environmental 
mycobacteria exposed controls (specific diagnosis. Figure 5). 

In general, all 13 antigens tested were recognised with 
5 responder frequencies varying between 21 and 86%. It is likely 
that a combination of several factors determines whether and to 
what degree mycobacterial proteins are immunogenic after 
infection. These factors could include (a) parameters intrinsic 
to the bacterium, such as the abundance of the p2:otein, its sub- 

10 cellular location, post-translational modification, 

participation in macromolecular complexes, and in vivo 
regulation; and (b) factors relating to the immune system, 
including location of the antigen with respect to the phagosome, 
proteolytic sensitivity, and the presence of motifs suitable for 

15 interaction with TAP transporters and different MHC alleles 
within the antigen. 

The present invention exploits the use of pools of overlapping 
synthetic peptides derived from the sequences of these proteins. 

20 In a pilot experiment where the peripheral monocyte blood cell 
(PBMC) was isolated from 8 cattle experimentally infected with 
M. jbovis and stimulated them with either recombinant ESAT-6 or a 
cocktail of 11 synthetic peptides spanning the whole sequence of 
ESAT-6, it was concluded that the numbers of IFN-y producing 

25 cells, determined in this case by ELISPOT, demonstrated 

equivalent responses to recombinant protein and synthetic 
peptides (r=0-92, p<0.0001). The number of peptide pools that 
represent the sequences of each ORF varies depending on the size 
of the antigen, as illustrated in Figure 4. It was demonstrated 

30 that the combined results from Rv3873, Rv3878 and Rv3879c ' 
resulted in an overall responder frequency of 95% • These 3 
antigens are represented by a total of 16 different peptide 
pools, containing 169 individual peptides. However, the same 
frequency of recognition can be obtained using just 3 pools out 

35 of the 16 pools assayed (pools 3, 8 and 9), i.e., 30 peptides, 
suggesting the presence of the immunodominant epitomes within 
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these three pools. Indeed, the nuiriber of peptides needed to 
achieve responder frequencies similar to that with the cortplete 
set of overlapping peptides could even be significantly lower 
since the data described in Figure 3 demonstrates that only one 
or two immunodominant peptides can be responsible for the 
immunogenicity of the whole pool. If these peptides were to be 
recognised promiscuously in the context of multiple MHC 
molecules, as has been described in the recognition of other 
mycobacterial antigens by human, murine and bovine CD4+ T cells 
[Vordermeier, H. M., et al. 2000. Clin. Infect. Dis. 30:S291- 
. S298; vordermeier, 2001 supra.; Vordermeier, H. M. 1995. Eur. 
Respir. J. Suppl. 20:657s-667s; Lightbody, K. A., et al 1998. 
Scand. J. Immunol. 48: 44-51; Lightbody, K. A., et al. 1998. 
immunology 93:314-22; Pollock, J. M. , et al.l994. Immunology. 
.5 82:9-15; Pollock, J. M., et al. 1995. Scand. J. Immunol. 41:85- 
93.], the nxamber of peptides required to achieve wide population 
coverage could be relatively low as has been demonstrated before 
for ESAT-6 and CFP-10 derived peptides [Vordermeier, 2001 
supra., Lalvani, A., et al. 2001. J. Infect. Dis. 183:469-477; 
Lalvani, A., et al. 2001. . Am. J. Respir. Crit. Care. Med. 
163: 824-8] . 



20 



Peptides as iramuno-diagnostic reagents can therefore constitute 
a practical alternative to recombinant proteins, in addition to 
25 sxabstituting them as reagents to assess immxinogenicity. The 

fact that all three animals tested, two from the UK and one from 
New Zealand, recognised the same peptide within pool 3 (peptide 
3.2) is encouraging in this context. 

30 interestingly, the previously described peptide cocktail 

-containing peptides derived from ESAT-6 and CFP-10 was also 
recognised by 95% of the M, bovls infected anijnals tested, in 
fact the same animals that responded to the combination of 
RV3873, RV3878 and Rv3879c. 

35 
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As described in Figure 6, 4 PPD/ PE genes were selected for 
testing (Rv3872, Rv3873, Rvl983 and Rvl768) and gave responder 
frequencies of between 45-82% when assayed in the H. hovls 
infected cattle. Little is known about the function or 
iinmunogenicity of these proteins, which account for 
approximately 10% of the total coding capacity of the 
tuherculosis genome. 

As described in Figure 3, peptide 3.2 is a highly immunogenic 
component of pool 3 derived from the sequence of Rv3873, a 
member of the PPE family of proteins. The pool consistently 
produces positive responses when assayed in M. Jbovis infected 
cattle with a responder frequency of 82% but was also recognised 
in BCG vaccinated animals. This is a surprising outcome given 
that its gene is deleted in BCG and that no homologous proteins 
were found elsewhere in the BCG genome- However, the unit of 
cross-reactivity is the epitope, less than 20 amino acids long, 
that is recognised by T cells in the context of MHC molecules. 
Consequently/ the molecular nature of cross-reactivity can. only 
be addressed once these epitopes have been identified. 

Therefore we used the sequence of peptide 3.2 (shown as SEQ ID 
NO. 9) to search for similar regions with other genes found 
within the M. tiibejrculosis genome. 

Figure 6 shows the results using the Basic Local Alignment 
Search Tool (BliAST) program [NCBI. Basic Local Alignment Search 
Tool (BLAST) . http : //www/ncbi . nlm, nih. gov/BLAST/ ] to identify 
similarity between mycobacterial proteins . The table shown in 
Figure 6 highlights several sequences that contain amino acid 
identities of greater than 50%. These include five proteins 
from the M. txxb&rculosls genome, all of which are also members 
of the PPE family and several others identified in proteins of 
various mycobacterial species. The peptide covers an area of 
the gene that encodes two motifs identified in a number of PPE 
family members during their annotation [Tekaia, F., et al 1999. 
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Tuber. Lung Dis. 79:329-42- TuberciiList: . MA.ST - Motif Alignment 
and Search Tool http : / /qenolist . pas tuer , f r/Tuberculiist/ 3 . 



This suggests that the cross reactive nature of the peptide is a 
5 result of similarity with other PPE family members located 
elsewhere in the genome of M. tulDerculosis and therefore the 
genome of i>ov±s BCG Pasteur- We conducted BIiAST searches for 
the other identified cross-reactive antigens (e.g./ Rvl979c) by 
corqparing the whole genes in steps of 20 amino acids, 
10 representing the corresponding peptides, and were able to find 
numerous similar axaino acid sequences in other mycobacterial 
proteins outside the deleted regions. 

The use of peptides instead of recombinant proteins, has several 
15 advantages already discussed. However, with regard to the 

observed cross reactivity of antigens between BCG vaccinated and 
M. hovls infected animals, this peptide-based approach has other 
distinct advantages. If ORF Rvl987 is taken as an example, it 
appears unsuitable as a differential diagnostic reagent due to 
20 the high cross reactivity in the BCG vaccinated cattle. 

However, the responder frequency of 57% in JdovIs infected 
cattle is due to the recognition of two pools with responder 
frequencies of 47% and 53% respectively- Whilst one pool is 
recognised by 50% of the BCG vaccinated animals, the other is 
25 not recognised. Therefore, the diagnostic potentials of this 
antigen can still be realised by using only peptides derived 
from the second peptide pool- 

In siammary, therefore, the analysis of peptides, derived from 
30 genes deleted in BCG Pasteur, has led to the identification of 
antigens for diagnosis and even vaccination. In particular, 
antigens that can form the basis of diagnostic reagents to 
either differentiate between infected and BCG vaccinated animals 
or to improve the specificity of PPD per se are described. In 
35 addition, it has also been demonstrated for the first time that 
mepnbers of the both the PE and ppE families of proteins induced 
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cellular immune response after mycobacterial infection of a 
target species . 

EXAMPLE 4 

Xnmcunogenxc antigens 

The Experiment described in Example 1 above could also be used 
to demonstrate that two antigens could be considered as 
potential vaccine candidates because they were highly 
immunogenic in all groups assayed* 

RESULTS 

The iinmunogenicity of the peptide pools described in Figure 4 
was assessed. 2 antigens, Rvl979c and Rvl769 (of SEQ ID NO 14 
and SEQ ID NO 15 respectively) showed responder frequencies of 
73% and 8 6% respectively in the 22 cattle experimentally 
infected with M. hovis fFigxare 5;. These two antigens were also 
strongly recognised by BCG vaccinated cattle with responder 
frequencies of 67 and 100% respectively. Uniquely however, they 
also showed 40 and 30% responder frequency in the PPD-A. 

These results indicated that two antigens can be considered 
vaccine candidates since they were recognised by T cells from 
all 3 categories (Rvl979c and Rvl769) . 

EXAMPLE 5 

DNA Fusion Vaccine 

A DNA vaccine comprising a fusion of two internal gene sequences 
was constructed in the vector pvmcLINK (Vordermeier et al. 
Vaccine 2000 Dec 8;19 (9-10) :1246-55) . 

The two gene sequences, derived from a section of the sequence 
of Rvl979c and Rvl769 respectively, were generated by polymerase 
chain reaction (PGR) and ligated together via their restriction 
enzyme (RE) digested termini. The two sections encode for 
polypeptides that stimulate the production of gamma interferon 
in blood from cattle inoculated with mycobacterium bovis. 
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mycobacterium bovis BCG and others exposed to environmental 
mycobacteria (Cockle et al. Infect Iinnmun 2002 Dec;70 (12) : 6996- 
7003). The vector itself was then cut using RE's at a specific 
position and the fused insert sequence ligated within it. 

5 

The complete DNA sequence of the DHA fusion vaccine is shown in 
Figure 13. The sequence highlighted in bold is that of the DNA 
fusion insert comprised of two ORF sections from Rvl7 69 and 
10 Rvl979c respectively. The rest of the sequence is that of the 
vaccine vector pvmcLINK. 

The section of the open reading frame (ORF) from Rvl979c that 
has been fused into the DNA vaccine is shown in Figure 14. The 

15 section is an internal gene sequence that encodes an area of the 
gene that, when assayed in the translated form as polypeptides, 
stimulates .the production of gamma interferon in blood from 
cattle inoculated with mycobacterium bovis, mycobacterium bovis 
BCG and others exposed to environmental mycobacteria (Cockle et 

20 al. supra-) - The position of the antigenic gene section within 
the ORF is highlighted in the sequence in bold. 

Figure 15 shows the section of the open reading' frame (ORF) from 
Rvl7 69 that has been fused into the DNA vaccine. The section is 

25 a gene sequence that encodes an area of the gene that, when 

assayed in the translated form as polypeptides, stimulates the 
production of gamma interferon in blood from cattle inoculated 
with mycobacterium bovis, mycobacterium bovis BCG and others 
exposed to environmental mycobacteria (Cockle et al. supra) . The 

30 position of the antigenic gene section within the ORF is 
highlighted in the sequence in bold- 

All references mentioned in the above specification are herein 
incorporated by reference. Other modifications of the present 
35 invention will be apparent to those skilled in the art without 
departing from the scope and spirit of the inventioA. Although 
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the invention has been described in connection with the specific 
preferred embodiments, it should be understood that the 
invention as claimed should not be unduly liraited to such 
specific embodiments. Indeed, various modifications of the 
described modes for carrying out the invention, which are 
obvious to those skilled in the art, are intended to be within 
the scope of the following claiicis. 



10 
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CLAIMS 

1 . A diagnostic reagent comprising a peptide comprising an 
epitope from at least one polypeptide selected from Rvl98 6 (SEQ 

5 ID NO 1), RV3878 (SEQ ID NO 3), Rv 1983 (SEQ ID NO 4), RV3873 
(SEQ ID NO 5) or Rv3879 (SEQ ID NO 6). 

2 . A diagnostic reagent according to claim 1 which comprises a 
peptide comprising a series of consecutive amino acids from 

10 within the polypeptide sequences defined in claim 1. 

3. A diagnostic reagent according to claim 1 or claim 2 which 
comprises an epitope from SEQ ID Nos 3, 5 or 6. 

15 4. A diagnostic reagent according to claim 3 which comprises a 
peptide which include an epitope from SEQ ID NO 23 as. shown in 
Figure 9, or a fragment thereof. 

5 . A diagnostic reagent according to claim 4 wherein the 
20 fragment is selected from SEQ ID NOs 7, 25, 28 and 29. 

6. A diagnostic reagent according to claim 3 which comprises 
SEQ ID NO 23, SEQ ID NO 1, SEQ ID NO 25, SEQ ID NO 28 or SEQ ID 
NO 29. 

25 

7 . A diagnostic reagent according to claim 3 which comprises 
an epitope from SEQ ID NO 35, shown in Figure 10. 

8 . A diagnostic reagent according to claim 7 which comprises 
30 SEQ ID NO 35 or a fragment thereof. 

9 . A diagnostic reagent according to claim 3 which comprises 
an epitope from SEQ ID NO 48, shown in Figure 11 hereinafter, or 
a fragment thereof . 

35 
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10. A diagnostic reagent according to claim 9 wherein the 
fragment is of SEQ ID NO 51. 

11. A diagnostic reagent according to claim 9 which comprises 
SEQ ID NO 48 or variant or fragment thereof, 

5 

12. A diagnostic reagent according to claim 11 which comprises 
SEQ ID NO 51. 

13. A diagnostic kit comprising at least two diagnostic 
10 reagents, at least one of which is a diagnostic reagent 

according to any one of claims 1 to 12 . 

14- A diagnostic kit according to claim 13 which further 
comprises one or more polypeptides or peptides derived from 
15 ESAT-e and/ or the CFP-10 polypeptides. 

15. A diagnostic kit according to claim 13 wherein the 
diagnostic reagents are selected so that they are able to 
differentiate between MycohactBrium hov^ls^ Mycohacterium 

2 0 tuberculosis or Myaohact&rium africanum -infected mammals and 
mammals vaccinated* against J>^ycol>acterdum JdovIs^ Mycohacterium 
tuberculosis or Mycobacterium africanum^ 

16. A nucleic acid which encodes a diagnostic reagent according 
25 to any one of claims 1 to 12. 

17. A method for diagnosing infection in a host, or exposure of 
a host, to a mycobacterium, said method comprising 

i) contacting a population of cells from the host with a 

30 diagnostic reagent according to any one of claims 1 to 12; and 

ii) determining whether the cells of said cell population 
recognise the diagnostic reagent. 

4 

18 . A method according to claim 17 wherein the population of 
35 cells is a population of T-cells. 
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19. A polypeptide conprising any one of SEQ ID NO 1, 2, 3, 4, 5 
or 6, or variants thereof, or fragments of any of these, which 
produce a protective iinmxine response in a mammal to whom they 
are administered, for use as a medicament. 

20. A polypeptide derived from an RD2 or RD14 region of the 
Mycobacterium tuberculosis, Mycobacterium bovis or Mycobacterium 
africanum genomes, or a variant thereof, or a fragment of any of 
these, for use as a medicament, with the proviso that the 
polypeptide is not a MPT- 64 polypeptide or a polypeptide encoded 
by the Rvl984c region of the Kycoi^acterimn tuberculosis, 
Mycobacterium bovis or j^ycojbacterium africanum genomes. 

21. A polypeptide according to claim 19 or claim 20 which is 
derived from the Mycobacterium tuberculosis genome. 

22. A polypeptide according to claim 20 which comprises the 
sequence shown in SEQ ID Nos 14 or 15, or a variant thereof or 
fragment thereof. 

23. A polypeptide* according to claim 22 which is of SEQ ID NO 
14 or 15 or an epi topic fragment thereof. 

24. A polypeptide according to claim 20 which comptises a 
fusion of a region of SEQ ID NO 14 and a region of SEQ ID NO 15, 
which fusion polypeptide is able to produce a protective immune 
response in a mammal to which it is administered. 

25. A polypeptide according to claim 24 which comprises SEQ ID 
NOs 18, 20 and 22, or a protective variant or and fragment 
thereof. 

26. A vaccine comprising a polypeptide according to any one of 
claims 19 to 25. 
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27. A vaccine according to claim 26 comprising one or more 
protein subunits . 

28- A nucleic acid which encodes a polypeptide according to any 
5 one of claims 19 to 27 for use as a vaccine. 

29. A nucleic acid according to claim 28 which comprises SEQ ID 
.Nos 59 or 60, or a variant or fragment thereof. 

10 30- A nucleic acid according to claim 28 which comprises SEQ ID 
NO 17. 

31. A pharmaceutical or veterinary composition comprising a 
protective polypeptide as described above, or a nucleic acid 

15 which encodes this, in combination with a pharmaceutically or 
veterinarily acceptable carrier. 

32 . A method of protecting a mammal against infection by 
J^ycoJbacterium hovds, Mycoha.ctLBrivim tuberculosis or Mycoha.c1zerlum 

20 a-fricemuin. comprising administering to said mammal a polypeptide 
according to any ope of claims 19 to 25, a nucleic acid 
according to any one of claims 2 8 to 30 or a composition 
according to claim 31. 

25 33. A method of protecting a mammal against infection by 

Mycohacterium. hovls^ Mycohacterlum tuhGrculosis or Mycoha.ctGrlum 
africanum comprising administering to said mammal a polypeptide, 
peptide or pharmaceutical or veterinary composition according to 
the present invention which produces an immune response against 

30 J^y coJba eteri ujn jbovis, Mycohacterlum tuberculosis or Mycobacterium 
atrlcanum. 
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Figure 3 
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Figure 4. RD antigens selected for evaluation in this study 



Deleted 
Region 



Designation^ 



Size 
(Amino 
Acids) 



Peptide 
Pools^ 



Putative Function^ 



Rv3872 



Rv3873 



99 



1 

(10) 



Member of PE-like protein family 



368 



RDl 



2-5 
(40) 



Member of M tuberculosis PPE 
family - 



Rv387 



280 



6-8 

(30) 



Unknown, alanine-rich protein 



Rv3879c 



729 



RD2 



RD14 



Rv 1979c 



Rvl983 



Rvl984c 



Rvl986 



Rvl987 



Rvl98?c 



Rvl768 



Rvl769 



Rvl772 



481 



9-17 

(90) 
18-23 

(60) 



Unknown, alanine-prolme-rich 

rotein 

Possible amino acid permease 




558 



217 



24-30 

(70) 
31-33 

(30) 



Member of the PE-PGRS sub-family 
of glycine-rich proteins 
Probable secreted cvttinase 



199 



34-35 

(20) 



Possible lysine transporter 



142 



36-37 
(20) 



Possible chitinase 



186 



618 



38-39 
(20) 
40-46 
(70) 



Unknown 

Member of the PE-PGRS sub-family 
of glycine-rich proteins 



414 



47-51 
J50L 



Similar to Strepton^ces coelicolor 
hypothetical protein 



103 



51-52 
(20) 



Unknown 



"Rv designation of ORP as defined [Cole, 1998 1998. Nature 393: 537-44] 
*^umber of peptide pools required to cover ftill sequence (total number of peptides 
required shovkOi in brackets) 

Mutative function as si^gested [Cole, 1998 supra.] 
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Figure 5. List of most frequently recognised antigens' 



Designation 



Rvl986 
Rv3872 
Rv3878 



Combined 

Rvl983 
Rv3873 
Rv3879c 
Combined 

Rvl979c 
Rvl769 



^M.hovis 
Reactors 

41 

50 
59 



Responder Frequency % 

^h/Lavium 



IBCG 
Vaccinated 

"o 

0 
0 



82 

59 
82 
77 

91 

73 
86 



0 

33 
17 
33 

50 

67 
100 



Reactors 

"o 

0 
0 

"o 

0 
0 
0 
0 

40 
'30 



Potential Application 



Differential Diagnostics 



Specific Diagnostics 



Vaccines 



''Only antigens recognised by >40% of Mbovis infected animals are listed 

Results from 22 cattle experimentally infected with M6ovw 

"Results from 5-6 BCG vaccinated cattle 

'^Results from 10 environmental mycobacteria sensitised cattle 
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Figure 6. Sequence homology between peptide 3.2 from Rv3873 with other 
mycobacterial proteins 



Designation^ Putative Function Amino Acid Sequence 



SEQID 
NO 



RV3873 



Rv3021c 



Rv0286 



Rv3018c 



RvO280 



M.tuberculosis PPE AMATTPSLPEIAANHIT 



family 

MJubercuIosis PPE 
family 

M tuberculosis PPE 
family 

Mtuberculosis PPE 
family 

Adi tuberculosis PPE 
family 



ALAEMPTLPELAANHLT 



67 



ALAAMPTLAELAANHVI 68 



ALAEMPTLPELAANHLT 69 



AVAAMPTLVELAANHTL « 70 



The homology search was performed using the BLAST program. ^Designation of M 
tuberculosis proteins as described [Cole, 1998 supra.]. ^The sequence inM 
tuberculosis and A£ bovis was found to be identical. Amino acid residues are shown 



in the one letter code. Non-identical residues are underlined. 
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Figure 7 



>Rvl983: 558 aa - M. tuberculosis - SEQ. I.D NO. 4 



1 




VSFLVWPEF 


61 




EYQAVSAQAS 


121 




LIGDGAPGTA 


181 




PGGAGGNGGW 


241 




DRGTGGAGGR 


301 




AGLWSPEDV 


361 




VLLSIPTSPF 


421 




GDLNQGVLID 


481 




GTIPSYVIGS 


541 




YIDYSPSGIG 


>Rvl986: 199 aa 


1 




VNSPLWGFL 



APAC^ELVFGP NFJ-iir'iUr'JNViLV vooxtxaa^jav i^^*^w>-* • 

GTLPANTNIE VYTSPGGDRL YAFNTNDYRP TVISSGIMNT GFLPFRFQPV 
TTVFDHPA 



ifi : SSS iSSi^s^ «x^;«kT :^.o«^o ^^^.<.s«^^ 

181 - LDGLIAVMbiV ALGISLTVT 

•^TiTrriQno* qq - M tuberculosis - SEQ. I.D. NO. 2 

T-^mlnillA ^IG?QVSDN ALHGVTAGST ALTSVTGLVP AGADEVSAQA ATAFTSEGIQ 
61 - LLASNASAQD QLHRAGEAVQ DVARTYSQID DGAAGVFAE 

^DTr-3QT5. "^fiH - M tub<»rculosis - SEQ. I.D. NCX 5. 

I !^w^pIel n^^^ag papmiaaaag wqtlsaalda Qaveltarln slgeawtggg 

(^^ - SDKAiAAATP MWWLQTAST QAKTKM4QAT AQAAAYTQAM ATTPSLPEIA ANH^QAyLT 
n t^ ^SSSSSti PIALTEMDYF IBMWNQAALA MEVYQAETAV NTLFEKLEPM ASILDPGASQ 
]ll ' tSipSp KSiSvGQ LPPAATQTLG QLGEMSGPMQ QLTQPLQQVT SLFSQVGGTG 
111 - ggSJSI Sgllgtspl SNHPLAGGSG PSAGAGLLRA ESLPGAGGSL trtplmsqli 
301 - EK?v5?lw Sssatg GAAPVGAG^ gqgaqsggst rpglvapapl aqereedded 

361 - DWDEEDDW 

^TDxT-^QTft. 9ft 0 ~ M tuberculosis - SEQ. I.D. NO, 3 

"^f ! L:pI^vJpt glsaLakla glvfpqppap iavsgtdsw aainetmpsi eslvsdglpg 

fil VTOALTRTAS NMNAAADVYA KTDQSLGTSL SQYAFGSSGE GLAGVASVGG QPSQATQLLS 
121 - ^^V?toJ SSJeSaPR WATVPQLVQ LAPHAVQMSQ NASPIAQTIS QTAQQAAQSA 
ll^ - QgS§pS?Q SsJ^^EKPATE QAEPVHEVTN DDQGDQGDVQ PAEWAAiLRD EGAGASPGQQ. 
241 - PGGGVPAQAM DTGAGARPAA SPLAAPVDPS TPAPSTTTTL 

>>OTT'?jnc»r-- 779 aa - M. tuberculosis - SEQ. I.D. NO. 6 

>RV3879C 729 aa ^ vEADEDTFYD RAQEYSQVIQ RVTDVIDTCR QQKGHVFEGG 

^^ ~ SsgSSSS SSnSq LMTLQDYLAT VITWHRHIAG LIEQAKSDIG NNVDGAQREI 
itl : DILeSSS SSSSi LVTATHGftNV SLVAETAERV LESKNWKPPK NALEDLLQQK 
^Ri ^^SSSvSl vvpSgtpgt PGTPITPGTP ITPGTPITPI PGAPVTPITP TPGTPVTPvTT 

2 : ISpv?pvS ™?pgSt pxtpvtppva patpatpatp vt^^hpq p^I^ 

tni pnPVTPATPG PSGPATPGTP GGEPAPHVKP AALAEQPGVP GQHAGGGTQS GPAEIADESAA 

361 - s??pSS^ pSraaaaap sgtavgagar ssvgtaaasg agshaatgra pvatsdkaaa 
It] S^JisSr Sparppstd hidkpdrses addgtpvsmi pvsaaraard aataaasarq 
tl^ Irl^^^ rri^nas dnnagdygff witavttdgs iwansygla yipdgmelpn 

~ SS?tSSSS S^^^Srta TYPVLAVQAW i^FHDMTLRA VIGTAEQLAS SDPGVAKIVL 

eol - S^dSSS SSlJSw dpsaaaq£ad ttdqrlldll ppj^vdvnpp gder^wfe 

66^ - GREAAHLRAF RAYAAHSQEI ALHQAHTATD AAVQRVAVAD WLYWQYVTGL 

721 - LDRALAAAC 

^PTTiqiqr-- 481 aa " M. tuberculosis - SEQ I.D. NO. 14 ^™^,„t 
T ? ?gPR*RGyS HKLGFCSWM LGINSIIGAG IFLTPGEVIG LAGPFAPMAY VLAGIFAGW 

. S™SSy vrtngasyay ttaafgrrig iwgvthait asiawgvlas ^-s^l^k^S 
. ^dSaSae qlfsvktltf lgfigvxlai nlfgnraikw angtstvgka falsafivgg 

TMTTTTOHVN NYATAWSAYS ATPYSLLGVA EIGKGTFSSM AIATIVALYA FTGFESIANA 

: SSdI?^ S^Sif svgaiylltl tvamllgsnk iaasddtvkl aaaignatfr 

mfSnvaasf gaprlwtala dsgvlptrls rknqydvpmv sfaitaslal 



61 
121 

181 
241 
301 
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361 - AFPLALRFDN LHLTGLAVIA RFVQFIIVPI ALIALARSQA VEHAAVRRNA FTDKVLPLVA 
421 ~ IWSVGLAVS YDYRCIFLVR GGPNYFSIAL IVITFWVPA MAYLHYYRII RRVGDRPSTR 

>Rvl769: 414 aa - M. t\iberculosis - SEQ. I.D. NO. 15 

1 - VHEVAAREQR SDGPMRLDAQ GRLQRYEEAF ADYDAPFAFV DLDAMWGNAD QLLARAGDKP 
61 - IRVASKSLRC RPLQREILDA SERFDGLLTF TLTETLWLAG QGFSNLLLAY PPTDRAALRA 
121 - LGELTAKDPD GAPIVMVDSV EHLDLIERTT DKPVRLCLDF DAGYWRAGGR IKIGSKRSPL 
181 - HTPEQARAIA VEIARRPAiT LAALMCYEAH lAGLGDNVAG KRVHNAIIRR MQRMSFEELR 
241 - ERRARAVELV REVADIKIVN AGGTGDLQLV AQEPLITEAT AGSGFYAPTL FDSYSTFTLQ 
301 - PnaMTJ-ALPVC PJ?PGAKTVTA. LGGGYLASG^/ GAKDPJyTPTPY LPVGLB3.MAL EGTGEVQTPL 
361 - SGDAARRLKL GDKVYFRHTK AGELCERFDH LHLVRGAEW DTVPTYRGEG RTFL 
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Figure 8 

>Rvl983- 1674 bp - K. tuberculosis - SEQ. I.D. NO 64 

ggtca 

gcgagttcggcggctagtcggtctacctcagggtctttg 
atattcagcgccacaggtagatggtaccagcaaatagcc 
actatctacctaacgcgtgctgtgccgtgcggtagctac 
tgaaaatccgagatgtcaaaggcagcgtctggatacgct 
gtatgcgcgcagggatggtgatcgaggcggaggggcggc 

1 - gtg tea ttt ctg gtc gtg gtt ccc gag ttc 
31 - ttg acg tec gcg gca gcg gat gtg gag aac 
61 - ata ggt tec aca ctg cgc gcg gcg aat gee 
91 - gcg get gee gee teg ace ace gcg ctt gcg 
121 - gee get ggc get gat gag gta teg gcg gcg 
151 - gtg gca gcg ctg ttt gee agg ttc ggt cag 
181 - gaa tat caa gcg gtc age gcg cag gcg age 
211 - get ttc eat eaa cag ttc gtg cag acg ctg 
241 - aac teg gcg tea gga teg tat gcg gee gcg 
271 - gag gee acc ate gcg tea cag ttg cag aec 
301 - geg cag cac gat ctg ctg ggc gcg gtc aat 
331 - gca eea ace gaa acg ttg ttg ggg cgt ecg 

3 61 - eta ate ggc gac gga gca eee ggg acg gca 
391 - acg agt ecg aat ggc ggg gcg ggt ggg ctg 
421 - ctg tac gge aac ggc ggc aac ggt tat tec 

4 51 - gcg acg gcg teg ggg gtc ggc gge ggg gee 
481 - gge ggt tec gcg ggg ttg ate ggc aat- gge 
511 - gge gee ggg gga gee ggc gga ccc aac gee 
541 - eee gag gga gee gge gge aac ggt ggc tgg 
571 - ctg etc ggc aac ggc ggg ate gge ggg ccc 
601 - ggg gge gcg teg age ate ccc ggc atg agt 
631 - ggt gga gee gge gga ace ggc ggt gee gca 
661 - gga ctt ttg ggc tgg gga geg aac gge gga 
691 - gee ggc ggc etc 'ggt gat gga gtc ggt gtc 
721 - gat cgt ggc acg gge ggc gee gga ggc cgc 
7 51 - gge ggc ctg ttg tat gge gga tac ggc gtc 
781 - agt ggg eca ggc ggc gac gge aga acc gtc 
811 - ecg ctg gag ata att cat gtc aca gag ecg 
841 - acg gta cat gee aac gtc aac ggc gga ecg 
871 - acg tea ace att ctg gtc gac acc gga tec 
901 - get ggt ctt gtt gtc teg cet gag gat gtc 
931 - ggg gga ate ctg gga gtg ctt cac atg ggc 
961 - etc cea ace gga ttg age ate age ggt tac 
991 - age ggg ggg ctg tac tac ate ttc gee acg 
1021- tat acc acg acg gtg gac ttc ggg aat ggc 
1051- ate gtc acc gcg ecg acc gee gtt aat gtc 
1081- gtc etc ttg tec ate cea acg tee ccc ttc 
1111- gee att teg acc tac ttc age gee ttg ctg 
1141- gee gat ecg aca aca act ecg ttc gaa gee 
1171- tat ttc ggt gee gtc ggc gtg gac ggc gtt 
1201- ctg gga gtt ggg ccc aat gcg gtg gga cea 
1231- ggc ccc age att ecg acg atg gcg tta ecg 

•1261- ggt gac etc aac cag gga gtg etc ate gac 
1291- gca ccc gca ggt gag etc gtg- ttc ggt ccc 
1321- aac ecg eta cet gcg ccc aac gtc gag gtc 
1351- gtc gga teg ecg ate acc acc ctg tac gta 
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Figure 8 (Cont'd) 

1381- aag ate gat ggt ggg act ccc ata ccc gtc 
1411- CCC teg ate ate gat tec ggt ggg gta acg 
1441- gga acc ate ccg tea tat gtc ate gga tec 
1471- gga acc ctg ccg gcg aac aca aac att gag 
1501- gtc tac acc age ccc ggc ggt gat egg etc 
1531- tac gcg ttc aac aca aac gat tac ege ccg 
1561- acc gtc att tea tec ggc ctg atg aat acc 
1591- ggg ttc ttg ccc ttc aga ttc cag ccg gtg 
1621- tac ate gac tac age ccc age ggt ata ggg 
1651- aca aca gtc ttt gat cat ccg gcg 

tgatcgagcctgttegccgcgaatgtcgccgcctggctt 

gtcatcccegactgaacatacgaaacatgegecataata 
ttgccgectccggtgcatattggategtcgggagcacac 
aagtttatggtcttagagctatacagcggaccgattgtc 
ggeaacgacccgecgeeceacaacatgctggagaaacca 

ctgga 

>Rvl983: 1674 bp - M. tuberculosis - SEQ. I.D. NO. 11 

gtgtcatttctggtcgtggtteecgagttettgacgtcegcggcagcggatgtggagaac 

ataggttccacactgcgcgcggcgaatgcegeggetgccgcetegaccaccgegettgcg 

gccgctggegetgatgaggtatcggcggcggtggcagcgctgtttgccaggttcggtcag 

gaatatcaageggtcagcgcgeaggegagegctttecatcaacagttcgtgeagaegctg 

aactcggcgtcaggatcgtatgcggeegeggaggccaccategcgtcaeagttgcagaec 

gcgcagcacgatctgctgggcgcggtcaatgcaccaaccgaaacgttgttggggcgtceg 

ctaatcggcgacggagcaeccgggacggeaacgagtcegaatggcggggcgggtgggctg 

ctgtacggcaacggcggcaacggttattecgcgacggegtcgggggteggcggeggggec 

ggcggttccgcggggttgatcggeaatggcggcgccgggggagecggcggaeecaacgec 

cccgggggageeggcggcaacggtggctggctgctcggcaacggcgggateggcgggccc 

gggggcgcgtcgagcatccceggcatgagtggtggageeggeggaaccggeggtgccgca 

ggacttttgggctggggagcgaacggcggagecggcggcetcggtgatggagteggtgtc 

gategtggcacgggcggcgccggaggccgcggcggcctgttgtatggcggataeggcgtc 

agtgggccaggeggcgacggcagaacegtcccgctggagataattcatgtcaeagagccg 

aeggtacatgceaacgtcaacggcggaeegaegtcaaeeattctggtegaeaecggatcc 

gctggtcttgttgtctcgcctgaggatgtcgggggaateetgggagtgettcaeatgggc 

eteeeaaceggattgagcatcagcggttacagcggggggctgtactacatcttcgccacg 

tataccacgaeggtggacttcgggaatggcategtcacegcgcegaccgccgttaatgtc 

gtcctcttgtccatcccaacgtcececttcgecatttcgacctacttcagcgcettgctg 

gccgatccgacaacaactccgttcgaagcctatttcggtgccgtcggcgtggacggcgtt 

etgggagttgggeccaatgcggtgggaceaggccccagcatteegacgatggegttaccg 

ggtgacctcaaccagggagtgctcategacgcaccegeaggtgagetegtgtteggtccc 

aacccgctacctgcgcccaacgtcgaggtcgtcggatcgccgatcaccaceetgtacgta 

aagategatggtgggactcccataeeegtcccctcgatcatcgattccggtggggtaacg 

ggaaccatcccgtcatatgtcatcggatccggaaceetgecggcgaaeacaaacattgag 

gtctacaccagccccggcggtgatcggctctacgcgttcaacacaaacgattaecgcccg 

accgtcatttcatccggcctgatgaataccgggttcttgcccttcagattccagccggtg 

taeategactacagccccagcggtatagggaeaacagtctttgateatecggcg 
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Figure 8 (Cont'd.) 

>Rvl986: 597 bp - M. tuberculosis - SEQ. I.D. NO. 61 

tgtag 

gcgctccgcggccgcatcgaagctgcccagttcgaccac 
ggcagccaatgcggccagctgtggaccgtcaagctgcgg 
atccaccatctcaggtgtagaccatctgcggagcgtcgc 
actgcacattaataatgctaatgtaaatgaagaattatt 
agctatactgacccatacaaactgcctagtgtcgattgc 

1 - gtg aac tea cca ctg gtc gtc ggc ttc ctg 

31 - gcc tgc ttc acg ctg ate gcc gcg att ggc 

61 - gcg cag aac gca ttc gtg ctg egg cag gga 

91 - ate cag cgt gag cac gtg ctg ccg gtg gtg 

121 - gcg ctg tgc acg gtg tec gac ate gtg ctg 

151 - ate gcc gcc ggt ate gcg ggg ttc ggc gca 

181 - ttg ate ggc gca cat ccg cgt gcg etc aat 

211 - gtc gtc aag ttt ggc ggc gcc gcc ttc eta 

241 - ate ggc tae ggg eta ctt gcg gee egg egg 

271 - gcg tgg ega cct gtt gcg ctg ate cca tct 

301 - ggc gcc acg ccg gtt cgc tta gcc gag gtc 

331 - ctg gtg aec tgt gcg gea ttc acg ttc etc 

361 - aac cca cac gtc tac etc gac ace gtc gtg 

391 - ttg eta ggc gcg ctg gee aac gag cac age 

421 - gac cag cgc tgg ctg ttc -ggc etc ggc gcg 

451 - gtc aca gcc agt gcg gta tgg ttc gcc aec 

481 - etc ggg ttc gga gcc ggc egg ttg cgc ggg 

511 - ctg ttc aec aac ccc ggc teg tgg aga ate 

541 - etc gac ggc ctg ate gcg gtc atg atg gtt 

571 - gcg ctg gga ate teg ctg ace gtg aec 

tagtacagcacgtgtgcacacgcgggttggaccacgtga 

tcgtcgatgggcacataccgttcggcaggagggcgcgcg 

gtcagtetgcacaacteagteaceagctgacaegecgac 

ggcggcctcgcecgg^egtgteggcgccaccagtgcaea 

ttcggegtgacgcggccctacggatcgtgttggagetgt 

agccc 

>Rvl986: 597 bp - M. tuberculosis - SEQ. I.D. NO. 8 
gtgaactcaccactggtcgtcggcttcctggectgetteacgctgatcgccgcgattggc 

gegcagaacgeattegtgctgcggcagggaatccagcgtgagcaegtgetgccggtggtg 

gcgctgtgeacggtgteegacatcgtgctgatcgccgecggtatcgcggggttcggcgca 

ttgatcggcgcacatccgegtgegetcaatgtcgtcaagtttggcggcgccgccttccta 

atcggctacgggctacttgcggcceggegggcgtggegacctgttgcgctgatcccatct 

ggcgccacgccggttcgcttagccgaggtcctggtgacctgtgeggcatteaegttcctc 

aacceacaegtctacctcgacaccgtcgtgttgctaggcgcgctggecaacgagcacagc 

gaccagcgctggctgttcggcetcggcgcggtcaeagccagtgcggtatggttcgccacc 

etcgggttcggagccggceggttgcgcgggctgttcaccaaccccggctcgtggagaatc 

ctegacggcctgategeggtcatgatggttgcgctgggaatctegctgaccgtgace 
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Figure 8 (Cont'd) 

>Rv3872: 297 bp - M. tuberculosis - SEQ. I.D. NO. 62 

ggccc 

cctacatcgagcctccagaagaagtgttcgcagcacccc 
caagcgccggttaagattatttcattgccggtgtagcag 
gacccgagctcagcccggtaatcgagttcgggcaatgct 
gaccatcgggtttgtttccggctataaccgaacggtttg 
tgtacgggatacaaatacagggagggaagaagtaggcaa 

1 - atg gaa aaa atg tea cat gat ccg ate get 

31 - gcc gac att ggc acg caa gtg age gae aac 

61 - get ctg cac ggc gtg acg gee ggc teg acg 

91 - gcg ctg acg teg gtg ace ggg ctg gtt cec 
121 - gcg ggg gee gat gag gtc tec gee caa gcg 
151 - gcg acg gcg tte aca teg gag ggc ate caa 
181 - ttg ctg get tee aat gca teg gcc caa gac 
211 - cag etc cae egt gcg ggc gaa gcg gtc cag 
2 41 - gac gtc gee cgc ace tat teg caa ate gae 
271 - gae ggc gee gee ggc gtc tte gcc gaa 

taggcececaacaeatcggagggagtgatcaccatgctg 

tggcaegeaatgceaccggagetaaataccgeacggctg 
atggccggcgcgggtecggetccaatgettgcggcggcc 
gegggatggcagaegcttteggcggctctggacgctcag 
gecgtcgagttgaeegcgcgectgaaetctetgggagaa 

geetg 

>Rv3872: 297 bp - M. tuberculosis - SEQ. I.D. NO. 9 

atggaaaaaatgtcacatgatccgategctgccgacattggcaegcaagtgagcgacaac 

. getetgcacggcgtgaeggceggetegaeggcgctgacgtcggtgaccgggctggttcce 
gcgggggcegatgaggteteegcccaageggcgaeggegtteaeateggagggearccaa 
ttgctggcttccaatgcatcggcceaagaceagctceaccgtgegggcgaageggteeag 
gacgtegeeegeaeetattcgeaaatcgaegacggcgecgceggcgtcttegcegaa 
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;iRv3873: 1104 bp - M. tuberculosis - SEQ. I.D. NO 65 

atgag 

gtctccgcccaagcggcgacggcgttcacatcggagggc 
atccaattgctggcttccaatgcatcggcccaagaccag 
ctccaccgtgcgggcgaagcggtccaggacgtcgcccgc 
acctattcgcaaatcgacgacggcgccgccggcgtcttc 
gccgaataggcccccaacacatcggagggagtgatcacc 

1 - atg ctg tgg cac gca atg cca ccg gag eta 

31 - aat acc gca egg ctg atg gcc ggc gcg ggt 

61 - ccg get cca atg ctt gcg gcg gcc gcg gga 

91 - tgg cag acg ctt teg gcg get ctg gac get 

121 - cag gcc gtc gag ttg acc gcg cgc ctg aac 

151 - tct ctg gga gaa gcc tgg act gga ggt ggc 

181 - age gac aag gcg ctt gcg get gca acg ccg 

211 - atg gtg gtc tgg eta eaa acc gcg tea aca 

■?41 - cag gcc aag acc cgt gcg atg cag gcg acg 

271 - gcg eaa gee gcg gca tac acc cag gcc atg 

301 - gcc acg acg ccg teg ctg ccg gag ate gcc 

331 - gcc aac cac ate ace cag gee gtc ctt aeg 

361 - gee ace aac ttc ttc ggt ate aac acg ate 

391 - ccg ate gcg ttg acc gag atg gat tat ttc 

4^1 - ate cgt atg tgg aac cag gca gcc ctg gca 

451 - atg gag gtc tac cag gcc gag acc gcg gtt 

481 - aac acg ctt ttc gag aag etc gag ccg atg 

511 - gcg teg ate ctt gat cec ggc gcg age cag 

541 - age acg acg aac ccg ate ttc gga atg cec 

571 - tee cct ggc age tea aca ccg gtt ggc cag 

601 - ttg ccg ccg gcg get acc cag acc etc ggc 

631 - caa ctg ggt gag atg age ggc ccg atg cag 

661 - cag etq acc cag ccg ctg cag cag gtg acg 

691 - teg ttg ttc age 'cag gtg ggc ggc acc ggc 

721 - ggc ggc aac cca gcc gac gag gaa gcc gcg 

751 - cag atg ggc ctg etc ggc ace agt ccg ctg 

781 - teg aac cat ccg ctg get ggt gga tea ggc 

811 - cec age gcg ggc gcg ggc ctg ctg cgc gcg 

841 - gag teg eta cct ggc gca ggt ggg teg ttg 

871 - acc cgc acg ccg ctg atg tct cag ctg ate 

901 - gaa aag eeq gtt gcc cec teg gtg atg ccg 

931 - gcg get get gcc gga teg teg gcg acg ggt 

961 - ggc gcc get ccg gtg ggt gcg gga gcg atg 

991 - ggc cag ggt gcg caa tee ggc ggc tec acc 

1021- agg ccg ggt ctg gtc gcg ccg gca ccg etc 

1051- gcg cag gag cgt gaa gaa gac gac gag gac 

1081- gac tgg gac gaa gag gac gac tgg 

tgagcteecgtaatgacaacagacttcceggccacccgg 

accggaagacttgeeaaeattttggcgaggaaggtaaag 

' --' cegatg 
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ccgetaeectcgcgcaggaggcaggtaatttcgagegga 
tctceggegaectgaaaaceeagatcgaccaggtggagt 



cgacg 
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Figure 8 (Conf d) 

>Rv3873: 1104 bp - M. tuberculosis - SEQ. I.D. NO 12 

atgctgtggcacgcaatgccaccggagctaaataccgcacggctgatggccggcgcgggt 
ccggctccaatgcttgcggcggccgcgggatggcagacgctttcggcggctctggacgct 
caggccgtcqagttgaccgcgcgcctgaactctctgggagaagcctggactggaggtggc 
agcgacaaggcgcttgcggctgcaacgccgatggtggtctggctacaaaccgcgtcaaca 
caggccaagacccgtgcgatgcaggcgacggcgcaagccgcggcatacacccaggccatg 
gccacgacgccgtcgctgccqgagatcgccgccaaccacatcacccaggccgtccttacg 
gccaccaacttcttcggtatcaacacgatcccgatcgcgttgaccgagatggattatttc 
atccgtatgtggaaccaggcagccctggcaatggaggtctaccaggccgagaccgcggtt 
aacacgcttttcgagaagctcgagccgatggcgtcgatccttgatcccggcgcgagccag 
agcacgacgaacccgatcttcggaatgccctcccctggcagctcaacaccggttggccag 
ttgccgccggcggctacccagaccctcggccaactgggtgagatgagcggcccgatgcag 
cagctgacccagccgctgcagcaggtgacgtcgttgttcagccaggtgggcggcaccggc 




ggcggcaacccagccgaci. _ ^ 
tcgaaccatccgctggctgatggatcaggccccagcgcgggcgcgggcctgctgcgcgcg 

gagtcgctacctgacgcaggtgggtcgttgacccgcacgccgctgatgtctcagcrgatc 

gaaaagccggttgccccctcggtgatgccggcggctgctgccggatcgtcggcgacgggt 

ggcgccgctccggtgggtgcgggagcgatgggccagggtgcgcaatccggcggctccacc 

aggccgggtctggtcgcgccggcaccgctcgcgcaggagcgtgaagaagacgacgaggac 

gactgggacgaagaggacgactgg 
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Figure 8 (Cont'd) 



>Rv3878: 840 bp - M. tuberculosis - ^EQ-^I-D^ NO. 63 

tggatcaccggggtgtacgacacggtccgcaatatccgg 
t?Stgagccggatcggctgattggcggttcctgacagaa 
caSaagaScggcgcaggtttgcataccttcggcgcc 
cgacaaattactgcgattgagcgtgtggcgcgtccggta 
aaatttgctcgatggggaacacgtataggagatccggca 

1 - atg get gaa ccg ttg gcc gtc gat ccc acc 
31 - gg? ?tg age gca gcg gcc gcg aaa ttg gcc 
6^ - ggc a.tl git ttt .eg cag cr.t cog gcg ccg 
91 - a?c gcg gtc age gga aeg gat teg gtg gta 
121 - gca gca ate aac gag acc atg eca age ate 
lil - gaa ?eg etg gtc agt gac ggg ctg ccc ggc 
181 - gtg aaa gcc gee etg act ega aca gca tec 
211 - aac atg aac gcg gcg gcg gac gtc tat gcg 
241 - aag ace gat cag tea ctg gga acc agt. ttg 
571 - age cag tat gca tte ggc teg teg ggc gaa 
301 - ggc ctg get ggc gtc gcc teg gtc ggt ggt 
331 - cag eca agt cag get acc cag ctg ctg age 
361 - aca ccc gtq tea cag gtc aeg acc cag etc 
391 - ggc gag aeg gee get gag ctg gca ccc cgt 
421 - gtt gtt gcg aeg gtg ccg caa etc gtr cag 
451 - ctg get ccg cac gcc gtt cag atg teg caa 
48^ - III gca tec ccc ate get cag aeg ate agt 
511 - caa acc acc caa cag gcc gcc cag age gcg 
541 - cag ggc ggc age ggc eca atg ccc gca cag 
^71 - ct? g?e age get gaa aaa ccg gcc ace gag 
601 - caa gcg gag ccg gtc cac gaa gtg aca aac 
631 - gac gat cag ggc gac cag ggc gac gtg cag 
661 - ccg gee gag gtc gtt gcc gcg gca cgt gac 
691 - gaa ggc gee ggc 'gca tea ccg ggc cag cag 
721 - ccc ggc ggg ggc gtt ccc gcg caa gee atg 
751 - gat acc gga gcc ggt gcc cgc eca gcg gcg 
781 - Igt ccg ctg gcg gcc ccc gtc gat ccg teg 
811 - act ccg gca ccc tea aca acc aca aeg ttg 

tagacegggcctgccagcggctccgtctcgcacgcagcg 
cctgttgetgtcetggcctegtcageatgcggcggccag 




ccagtccgcgacggccacacgctggacggcegcgtcagt 
cgcagtgtgcgcttggtgcagggcaatctcctgtgagtg 



ggcag 



•^o,.-:ift7n- fl40 bo - M tuberculosis - SEQ. I-D. NO. 10 

gtgaaagcegecetgactcgaacagcatccaacatgaacgcgguyy yy y 
Sagaccgatcagteactgggaaccagtttgagccagtatgcattcggctcgtcggg 
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Figure 8 (Cont'd) 

cagggcggcagcggcccaatgcccgcacagcttgccagcgctgaaaaaccggccaccgag 
caagcggagccggtccacgaagtgacaaacgacgatcagggcgaccagggcgacgtgcag 
ccggccgaggtcgttgccgcggcacgtgacgaaggcgccggcgcatcaccgggccagcag 
cccggcgggggcgttcccgcgcaagccatggataccggagccggtgcccgcccagcggcg 
agtccgctggcggcccccqtcgatccgtcgactccggcaccctcaacaaccacaacgttg 
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>Rv-H879c: 2187 bp - tuberculosis - SEQ. I D. NO. 66 

cccgt 

gcgacagcgcggccagcgcaagcgaggtgaccacccggc 
tgatcgcccaagcggtgcatcatgcgcgcggattcaacg 
qSttactgcgaataccggcgcggggtggatccagcggcc 
gagccggcgtgaaatgccggaggccaaccggacggtgat 
ccgcgaggcgatctggcggtttggggagggcagtagggg 
1 - atg agt att acc agg ccg acg ggc age tat 
31 - gcc aga cag atg ctg gat ccg ggc ggc tgg 
61 - gtg gaa gcc gat gaa gac act ttc tat gac 
91 - egg gcc cag gaa tat age cag gtt ttg caa 
121 - agg gtc acc gat gta ttg gac acc tgc cgc 
l51 - cTg cag aaa ggc cac gtc ttc gaa ggc ggc 
181 - eta tgg tec ggc ggc gee gcc aat get gcc 
211 - aac ggc gcc etg ggt gea aac ate aat caa 
741 - ttg atg acg ctg cag gat tat etc gee acg 
2-71 - gtg att ace tgg cac agg cat att gee ggg 
301 - ttg att gag caa get aaa tec gat ate ggc 
331 - aat aat gtg gat ggc get caa egg gag ate 
361 - gat ate etg gag aat gae ect age ctg gat 
391 - get gat gag cgc cat acc gcc ate aat tea 
421 - ttg gtc acg gcg acg cat ggg gee aar gre 
451 - agt ctg gtc gcc gag acc get gag egg gtg 
481 - ctg gaa tec aag aat tgg aaa cct ccg aag 
511 - aac gea etc gag gat ttg ett cag cag aag 
541 - teg ccg cca cce eca gae gtg cct ace ctg 
571 - gtc gtg cca tec ccg ggc aca ccg ggc aca 
601 - ?eg gga acc ccg ate ace ccg gga acc ccg 
631 - ate acc ccg gga acc cca ate aca cce arc 
661 - eeg gga gcg ccg gta act ccg ate aca cca 
691 - acg cce ggc act cce gtc acg ccg gtg aee 
721 - ccg ggc aag ccg gtc acc ccg gtg acc ccg 
7 51 - gtc aaa ccg ggc aca cca ggc gag cca acc 
781 - ccg ate acg ccg gtc acc cce ccg gtc gcc 
811 - ccg gee aca ccg gea ace ccg gcc acg cce 
841 - gtt acc cca get cce get cca cac ccg cag 
871 - leg get ccg gea ccg gcg cca teg cct ggg 
901 - cec cag ccg gtt aca ccg gee act cce ggt 
931 - ccg tct ggt cca gea aca ccg ggc ace cca 
961 - ggg ggc gag ccg gcg ccg cac gre aaa cec 
99^ - gig gcg ttg gcg gag caa ect ggt gtg ccg 
1021- ggc cag cat gcg ggc ggg ggg acg cag teg 
1051- ggg cct gee cat gcg gac gaa tec gee gcg 
1081- teg gtg acg eeg get gcg gcg tec ggt gtc 
1111- ccg ggc gea egg gcg gcg gee gee gcg ccg 
1141- age ggt aee gee gtg gga gcg ggc gcg cgt 
1171- teg age gtg agt acg gee gcg gee teg ggc 
1201- gcg ggg teg cat get gcc act ggg egg gcg 
1231- ccg gtg get acc teg gac aag gcg gcg gea 
1261- ccg age acg egg gcg gcc teg geg egg acg 
1291- gea ect cct gcc ege ccg ccg teg acc gat 
1321- cae ate gac aaa cec gat cgc age gag .tct 
1351- gea gat gae ggt acg ccg gtg teg atg ate 
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Figure 8 (Conf d) 

1381- ccg gtg teg gcg get egg gcg gca cgc gac 
1411- gcc gcc act gca get gcc age gee cgc cag 
14 41- cgt ggc cgc ggt gat gcg ctg egg ttg gcg 
1471- cga egc^ ate gcg gcg gcg etc aac gcg tec 
1501- gac aac* aac gcg ggc gac tac ggg ttc tte 
1531- tgg ate ace gcg gtg acc ace gac ggt tec 
1561- ate gtc gtg gee aac age tat ggg ctg gcc 
1591- tac ata cec gac ggg atg gaa ttg ccg aat 
1621- aag gtg tac ttg gcc age gcg gat cac gca 
1651- ate ccg gtt gac gaa att gca cgc tgt gcc 
1681- ace tac ccg gtt ttg gcc gtg caa gee tgg 
1711- gcg get tte cac gac atg aeg ctg egg gcg 

17 41- gtg ate ggt acc gcg gag cag ttg gcc agt • 
1771- teg gat cec ggt gtg gcc aag att gtg ctg 
1801- gag eca gat gac att ccg gag age ggc aaa 
1831- atg acg ggc egg teg egg ctg gag gtc gtc 

18 61- gac cec teg gcg gcg get cag ctg gcc gac 
1891- act ace gat cag cgt ttg etc gac ttg ttg 
1921- ccg ccg gcg ccg gtg gat gtc aat eca ccg 
1951- ggc gat gag egg cac atg ctg tgg ttc gag 
1981- ctg atg aag cec atg ace age acc get ace 
2011- ggc cgc gag gcc get eat ctg egg gcg ttc 
2041- egg gee tac get gcc cac tea cag gag att 
2071- gee ctg cac caa gcg cac act gcg act gac 
2101- gcg gee gtc cag cgt gtg gee gtc gcg gac 
2131- tgg ctg tac tgg caa tac gtc acc ggg ttg 
2161- etc gac egg gcc ctg gcc gcc gca tgc 

tgacgaggccaggaeageaaeaggcgctgcgtgcgagac 

ggagccgctggcaggcccggtctacaaegttgtggttgt 
tgagggtgccggagtcgacggatcgacgggggccgccag 
cggactcgccgetgggcgggcaccggctccggtatccat 
ggcttgcgcgggaaegeecccgecgggctgetggecegg 

tgatg 

>Rv3879c: 2187 bp - M. tuberculosis - SEQ. I.D. NO. 13" 

atgagtattaccaggccgacgggcagctatgecagacagatgctggatccgggeggctgg 

gtggaagcegatgaagacactttctatgaccgggcccaggaatatagceaggttttgcaa 

agggtcaccgatgtattggacacctgccgecageagaaaggccacgtcttcgaaggcggc 

ctatggtcegacggcgccgecaatgctgeeaaeggegccctgggtgcaaacateaatcaa 

ttgatgacgctgcaggattatctcgccacggtgattacetggeacaggeatattgeeggg 

ttgattgageaagctaaatccgatatcggcaataatgtggatggcgctcaacgggagarc 

gatatcctggagaatgaecetagectggatgctgatgagcgccatacegccatcaattca 

ttggtcacggcgaegcatggggccaatgtcagtctggtegecgagacegctgagegggtg 

ctggaatccaagaattggaaacctccgaagaacgcactcgaggatttgcttcageagaag 

tegccgceaceeceagacgtgcetaccctggtcgtgecatcceegggcacaccgggcaca 

ccgggaaccccgatcaccccgggaaececgateaceecgggaaececaatcaeacccate 

ccgggagegccggtaactccgatcacaeeaacgeccggcactcecgtcacgceggiigacc 

ccgggeaagceggteaccecggtgaccecggtcaaaccgggcacaccaggcgagccaacc 

ccgateacgceggtcaeecccceggtcgccecggecaeaceggcaaecceggccacgccc 

gttaceeeagctcecgctccacaccegcagccggcteeggcaceggcgeeatcgcctggg. 

ceeeagccggttaeaecggecaetcecggtccgtctggtccagcaacaccgggcacecca 

gggggcgagceggcgecgcacgtcaaaeecgcggegttggcggageaacctggtgtgccg 

ggceagcatgcgggcggggggaegcagteggggcctgcccatgcggaegaarcegecgcg 
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Figure 8 (Conf d) 

tcggtgacgccggctgcggcgtccggtgtcccgggcgcacgggcggcggccgccgcgccg 
agcggiaccgccgtgggagcgggcgcgcgttcgagcgtgggtacggccgcggcctcgggc 
g?ggggtcgLtgctgccactgggcgggcgccggtggctacctcggacaaggcggcggca 
Lgagcacgcqggcggcctcggcgcggacggcacctcctgcccgcccgccgtcgaccgat 
caLLgaL^aScJgatcgcagcgagtctgcagatgacggtacgccggtgtcgatga^^ 

ccggtgLggcggctcgggcggcacgcgacgccgccactgcagctgccagcgccc^^^^ 
• cataaccgcggtgatgcgctgcggttggcgcgacgcatcgcggcggcgctcaacgcgtcc 

ScScaIcg?g?gcga?tacgggttcttctggatcaccgcggtgaccaccgacg^^^^ 
SLgtcgtggcSLcagctatgggctggcctacatacccgacgggatggaattgccgaat 

aaaatgLStggccagcgcggatcacgcaatr.r.r.ggttgacgaaHt.tgcacgctgtg^^ 
acc^a?ccggtt?tggccgtgcaagcctgggcggctttccacgacatgacgctgcgg^^^ 
ataatcggtaccgcgaagcaattggccagttcggatcccggtgtggccaagattgtgctg 

SgSSl^SStLcggagagcggcaaaatgacgggccggtcgc^ 



ScSScggcggcggctcagctggccgacactaccgatcagcgtttgctcgacttg^^^ 



<-caccaacaccqqtggatgtcaatccaccgggcgaT:gsguyy'-«=^^'=^^y>--^-^»«-— 
c?SS?aScccatgaccagcaccgctaccggccgcgaggccgctcatctgcgggcgttc 

cSSSi?gc?gcLactLcaggagattgccctgcaccaagcgcacactgcgac^^^ 
g2?gccgtc?ag?gtgtggccgtcgcggactggctgtactggcaatacgtcaccgggttg 

ctcgaccgggccctggccgccgcatgc 
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Figure 9 



Diagnostic Cocktail 1 



SEQ ID NO 23; 

AT AQAAAYTQAM ATTPSLPEIA ANHITQAVLT ATNFFGINTI PIALTEMDYF IRMWNQAALA 
MEVYQAETAV NTLFEKLEPM ASILDPGASQ STTNPIFG 



Derived from the sequence of a segment of Rv3873 (SEQ ID NO 5), as 
highlighted in bold below - 

>M. tuberculls bacterialRv3873|PPE68: 368 aa - PRE FAMILY PROTEIN 

1 - MLWHAMPPEL NTARIMAGAG PAPMLAA?VAG WQTLSAALDA QAVELTARLN SLGEAWTGGG 

61 - SDKALAAATP MV\7WLQTAST QAKTRAMQAT AQAAAYTQAM ATTPSLPEIA ANHITQAVLT 

121 - ATNFFGINTI PIALTEMDYF IRMWNQAALA MEVYQAETAV NTLFEKLEPM ASILDPGASQ 

181 - STTNPIFGMP SPGSSTPVGQ LPPAATQTLG QLGEMSGPMQ QLTQPLQQVT SLFSQVGGTG 

241 - GGNPADEEAA QMGLLGTSPL SNHPLAGGSG PSAGAGLLRA- ESLPGAGGSL TRTPLMSQLI 

301 - EKPVAPSVMP AAAAGSSATG GAAPVGAGAM GQGAQSGGST RPGLVAPAPL AQEREEDDED 
361 - DWDEEDDW 

Cocktail comprised of 1 1 peptides, each 20 amino acids long, with an overlap of 
12 residues. 

ATAQAAAYTQAMATTPSLPE SEQ ID NO 24 

TQAMATTPSLPEIAANHITQ SEQ ID NO 2 5 

SLPEIAANHITQAVLTATNF SEQ ID NO 2 6 

HITQAVLTATNFFGINTIPI SEQ ID NO 27 

ATNFFGINTIPIALTEMDYF SEQ ID NO 28 

TIPIALTEMDYFIRMWNQAA SEQ • ID NO 29 

MDYFIRMWNQAALAMEVYQA SEQ ID NO 30 

NQAAIiAMEVYQAETAVNTLF SEQ ID NO 31 

VYQAETAVNTLFEKLEPMAS SEQ ID NO 32 

NTLFEKLEPMASILDPGASQ SEQ ID NO 33 

PMASILDPGASQSTTNPIFG SEQ ID NO 34 

Peptides highlighted in bold are of special importance, data suggesting the main 
epitope of the pool lies within. 
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Figure 1 0 



Diagnostic Coclctail 2 



Sa QGG^S^PmL LASAEKPATE QAEPVHEVTN DDQGIK3GDVQ PAEWAAAKD EGAGASPGQQ 
PGGGVPAQAM DTGAGARPAA SPLAAPVDPS TPAPSTTTTL 

Derived from the sequence of a segment of Rv3878 (SEQ ID NO 3). highlighted 
in bold below - 

>M tuberculis bacterialRv38781Rv3878: 280 aa - CONSERVED 
HYPOTHETICAL ALANINE RICH PROTEIN 

^,,,„T, r-Tin?T3nPPRP TAVSGTDSW AAINETMPSI ESLVSDGLPG 

I : si s Si Si issi 1= 

241 - PGGGVPAOAM DTGAGARPAA SPIAAPVDPS TPAPSTTTTL 

Cocktail comprised of 12 peptides, each 20 amino acids long, with an overlap of 
12 residues. 

AQSAQGGSGPMPAQLASAEK ' SEQ ID NO 3 6 

GPMPAQIASAEKPATEOAEP SEQ ID NO 37 

SAEKPATEQflEPVHEVTNDD SEQ ID NO 3b 

QAEPVHEVTNDDQGDQGDVQ SEQ ID NO 39 

TNDDQGDQGDVQPAEWAAA SEQ ID NO 40 

GDVQPAEWAAARDEGAGAS SEQ ID NO 41 

VAAARDEGftGASPGQQPGGG SEQ ID NO 42 

• AGASPGQQPGGGVPAC2AMDT SEQ ID NO 4 3 

PGGGVPAQRMDTGAGARPAA SEQ ID NO 4 4 

AMDTGAGARPAASPIAAPVD SEQ ID NO 45 

RPAASPLAAPVDPSTPAPST SEQ ID NO 46 

SPLAAPVDPSTPAPSTTTTL SEQ ID NO 47 
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Figure 11 



Diagnostic Cocktail 3 

ufSJSrGSY l^QMLDPGGW VEADEDTFYD RAQEYSQVLQ RVTDV1.DTCR QQKGHVFEGG 
LWSGGAANAA NGALGANINQ LMTLQDYLAI VI 

Derived from the sequence of a segment of Rv3879c (SEQ ID NO 6). highlighted 
in bold below - 

>M tuberculis bacteria|Rv3879c|Rv3879c: 729 aa - HYPOTHETICAL ALANINE 
AND PROLINE RICH PROTEIN 



MSITRPTGSY ARDMLDPGGW VEADEDTFYD RAQEYSQVLQ KVTDVLDTCR QQKGHVE^GG 
^cSJSS. NS^NINQ LMTLQDYIAT VITWHRHIAG LIEQAKSDIG NN\7DGAQRE1 

S^SSpSS SSS! S^Shg.^ s.v^t^kv fs^lll 

-f™ 

SJSSg SSS?Sp ggepaphvkp aalaeopgvp gqhagggtqs gpahadesaa 
eS^lAi^rS pgJSaISI sgtavgagar ssvgtaaasg agshaatgra pvatsdkaaa 
ISSJSS ISSSS?^ Sdkpdrses addgtpvsmi. pvsaaraard aataas^arq 
rSSSJS dnnagdygff witavttdgs iwansygla yipdgmelpn 

??^5™CA TYPVIAVOAW AAFHDMTLRA VIGTAEQLAS SDPGVAKIVL 
SIrSw DPSAAAQLM) TTDQRiLDLL PPAPVDWPP GDERHMLWFE 
SSpm?BS SSISSJ SLmSoEI ALHQAHTATD AAVQRVAVAD WLYV.QYVTGL 



1 

61 

121 
181 
241 
301 
361 
421 
481 
541 
601 
661 

721 - LDRALAAAC 



Cocktail comprised of 10 peptides, each 20 amino acids long, with an overlap of 
12 residues. 

MSITRPTGSYARC2MLDPGGW . SEQ ID NO 4 9 

SYARQMLDPGGWVEADEDTF SEQ ID NO 50 

PGGWVEADEDTFYDRAQEYS SEQ ID NO 51 

EDTFYDRAQEYSQVLQRVTD SEQ ID NO 52 

QEYSQVLQRVTDVLDTCRQQ SEQ ID NO 53 

RVTDVLDTCRQQKGHVFEGG SEQ ID NO 54 

CRQQKGHVFEGGLWSGGAAN SEQ ID NO 55 

FEGGLWSGGATUJIAANGAiGA SEQ ID NO 56 

GAANAANGALGANINQLMTL SEQ ID NO 57 

ALGANINQLMTLQDYLATVI SE.Q ID NO 58 

Peptide highlighted in bold is of special importance, data suggesting the main 
epitope of the pool lies within it. 
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Figure 12 

>Rvi979c: 1443 bp - rubercuiosis - SEQ. l.D. NO. 70 

cgact 

cgatgctggcctagactcgcgaggaccgcgcggtggtca 
ctgcgcggatttggggcggcggaaatgagtgttcggtgc 
gcccactgcggtgactcacctgcagcgccggcatcgaca 
ggccgggagctcaagaatcgtcgctagagaatctatggt 
gcgttagaggattccctgctagacagccttggtgcggtg 
1 - gtc ggc ccg egg acg aga gga tat gcg ate 
31 - cac aag ctg ggt ttc tgc age gtc gtc atg 
61 - etc ggg ate aac teg ata ate ggc gee ggt 
91 - ate ttc eta act cca ggt gag gtg ate ggg 
121 - etc gca gga cec ttc gcg ccg atg gcc tat 
151 - gtt tta get ggc att ttc gcg ggt gtc gtg 
181 - gcg ate gtc ttc gcg acg gcg gca agg tac 
211 - gtc aga aca aac ggt gee tec tac gee tac 
241 - aca acg gee gca ttt ggg cgc egg ate ggc 
271 - ate tat gtc ggt gtc ace cac gcc att ace 
301 - gcg tec ate get tgg ggg gtg ttg get tct 
331 - ttt ttc gtc teg acg ctg ttg cga gtg gee 
361 - ttc ccc gae aag gee tgg gcc gac gee gag 
391 - caa ctg ttc agt gtg aag acg ctg acg ttt 
421 - etc ggc ttt ate ggc gtg ctg ttg gcc ate 
451 - aac etc ttc ggc aac egg gcg ate aag tgg 
461 - gcc acic gya acy Lea acy yla yyc aay yea 
511 - ttc gcg etc teg gca ttc att gtc ggc ggg 
541 - ctg tgg ate ate ace ace cag cac gtg aac 
571 - aac tac gca acg gcg tgg teg gca tac age 
601 - gcg ace ccg tac teg ttg ett ggc gtc gee 
631 - gaa att ggc aag ggc acg ttc teg agt atg 
661 - ycy cLy ycc acy aLL yLc ycy ILy Lau yea 
691 - ttc ace ggt ttc' gaa teg ate gcg aac gcc 
721 - gcc gaa gaa atg gac gcg ccg gac egg aac 
751 - ctg ccg aga get ata ccg ate gcg ate ttc 
781 - teg gtt ggc gcg ate tac ttg etc ace eta 
811 - acg gta gcg atg ctg etc gga teg aac aag 
341 - aLc ycc ycy Ley yac gac acc'yLy aa.a cLy 
871 - gee gcg gcc ate gga aac get ace ttc cga 
901 - acg ate ate gtc gtc gga gcc ctg ata teg 
931 - atg ttc ggc ate aa-t gtc gcg gcc teg ttc 
961 - ggt gca ccg egg ctt tgg ace gcg tta gcg 
991 - gac age ggg gtt ctg ccg aca cgc ttg tea 

1021- eye aay aae caa Lae yac yLy cey aLy yLc 

1051- tec ttc gca att acg gcg teg ttg gcg etc 
1081- gca ttc ccg ttg gcg ctg egg ttc gac aac 
1111- ctg cac ctg acc ggc ctg gcg gtg ate gcc 
1141- cga ttc gtc cag ttc ate ate gtg ccg ate 
1171- get etc ate gca ttg gcg agg tct cag gca 

1201- yLa yaa caL ycL ycL yLy eyy eya daL ycy 

1231- ttc ace gac aag gtg tta ccg ett gtt gcg 
1261- ate gtg gtc teg gtt ggg ctg gca gtg tec 
1291- tac gac tac cgc tgc ate ttt eta gtg egg 
1321- ggt ggt ccg aac tac ttc teg att get ttg 
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Piguire 12 (Cont'd) 

i35i- arc grg arc acg rrc grc grg gra ccg gcg 
1381- atg get tat ctg cac tac tac cga ate att 
1411- cgc egg gtt ggc gat egg ccg age act cge 
1441- tag 

attccgttggcgctgagctcgaacgggagaacacaacgg 
cgagcgatggcgggaatagcctggtcggtgcgggcaaga 
ttteaacctgcatteeeggateggcggcgcgggcaagcg 
tctgcaacgccgagggaetgtaggcacgtagtgcgetga 
taaagccgtcgtgeatgctcgagcgcatcgacgaccatg 
gcage 



>Rvl979c: 1443 bp - M. tuberculosis - SEQ. I.D. NO. 59 

gtcggcccgcggacgagaggatatgcgatccacaagctgggtttctgcagegtcgtcatg 

ctegggatcaactcgataatcggcgccggtatcttcctaactccaggtgaggtgateggg 

ctcgeaggacccttcgcgccgatggcctatgttttagctggcattttcgcgggtgtegtg 

gcgatcgtcttcgcgacggcggcaaggtacgtcagaacaaacggtgcctectacgcctac 

acaacggcegcatttgggcgccggatcggeatctatgtcggtgteacccaegccattacc 

gcgtccatcgcttggggggtgttggcttcttttttcgtctegacgctgttgcgagtggec 

ttccccgacaaggcctgggccgacgccgagcaactgttcagtgtgaagacgctgacgttt 

ctcggctttatcggcgtgetgttggeeatcaacctcttcggcaaccgggcgatcaagtgg 

gccaacggaacgtcaacggtaggcaaggcattcgegctctcggcattcattgteggcggg 

ctgtggatcatcaccacccageacgtgaacaactacgcaaeggcgtggtcggcatacagc 

ycgaccccy tactcgttgcttggcg'tcy ccgaaatty y caaggg cacy 

gcgctggccacgattgtcgegttgtacgcattcaccggtttegaatcgategcgaacgcc 

gccgaagaaatggacgcgccggaccggaacctgecgagagctatacegatcgcgatcttc 

tcggttggcgcgatctacttgctcacectaacggtagcgatgctgctcggatcgaacaag 

atcgecgcgtcggaegaeacegtgaaactggccgcggccatcggaaacgetaccttccga 

acgatcatcgtcgteggagceetgatatcgatgttcggcatcaatgtcgcggcctcgttc 

ggty caccy eg g ctttgga'ccy Cy ttag Cy gacagcy gg g ttctgccg acacgcttg 

cgeaagaaeeaatacgacgtgccgatggtetcettegcaattaeggcgtcgttggcgetc 

geattcccgttggcgetgcggttcgacaaectgeaeetgaecggcetggeggtgatcgcc 

cgattegtecagtteateatcgtgccgatcgetctcategcattggegaggtctcaggea 

gtagaacatgctgctgtgeggcgaaatgcgttcaccgaeaaggtgttaccgcttgttgeg 

atcgtggtctcggttgggctggcagtgtcctacgaetaecgctgeatctttetagtgegg 

yy tggtccgaactacttctcy atty ctttgatcgty atcacgttegtcgtggtaccggcg 

atggcttatetgcactactaccgaateattcgccgggttggcgatcggcegageactcgc 

tag 
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Figiire 12 (Cont'd) 

>Rvl769: 1242 bp - M. iiubercuiosis - SEQ. I.D. NO. 71 

tcggg 

cgggttgctattcggccaaaatgggatgcccgggccgtg 
agcgccccaacccaggccaaccccctatgggcaatctgc 
acatcaattggccaggtcgacagcagaccgcacacatct 
acgagattggttcccgatccgtgggtggggccgggaaaa 
gcggctgtaagagttggctaggttcagtagggtggcggc 

1 - gtg cat gag gtg get get cgt gag caa cgt 
31 - teg gac ggg cog atg agg ctg gat gcg cag 
61 - ggc cga ctg cag cgt tac gag gag gcg ttc 
91 - get gae tac gat gea ccg ttt gcg ttc gta 
121 - gat etc aac gcg atg tgg ggc aat gee gat 
151 - caa ctg ctt gcg cgc gee ggc gac aag ccg 
181 - ate egg gtg gcg teg aag teg ctg cgt tgc 
211 ~ cga cca ctg caa cgc gaa ate ctt gat gee 
241 - agt gag cga ttc gac ggg eta ttg aeg ttc 
271 - aeg ctt ace gag aeg ctg tgg ctt gcc ggc 
301 - caa ggt ttc. teg aac ctg ttg ttg gcc tac 
331 - ccg ccg acc gac egg gcg gca ttg cgt gcg 
361 - ctt ggc gag ctg aeg gcc aag gac ccg gae 
391 - ggg gcg ccg ate gtg atg gtg gac age gtg 
421' - gag cac ctt gac ctg ate gag cgc aeg ace 
451 - gac aag ccg gta egg ctg tgt ctg gat ttc 
481 - gaL gcc yyc LaL Lyy cyc ycu ggc ggg egy 
511 - ata aaa att ggt tec aag cgc teg ccg ctg 
541 - cac acc ccg gag cag get cgc gca etc gcg 
571 - gtg gag ate gcg egg egg ccg gcg eta aeg 
601 - ttg gcg gcg ttg atg tgc tac gag gcc cac 
631 - att gcg ggc etc ggt gac aac gte gcc ggc 
661 - aag cgy yLc yao adC ycy aLc aLc cyL oyy 
691 - atg cag cgc atg teg ttc gaa gag ctg cgc 
721 - gag cgt cgt gcc egg gee gte gag ctg gtg 
751 - cgc gag gte gcc gac ate aag ate gte aac 
781 - gcc ggt ggc ace ggc gac ttg cag ctg gtt 
- gcg cag gag ccg ttg att acc gaa gcg acc 

.-.^ - ycu yy^ Lu;y yyL LLL Lau yuy ^uy cx^cx ^Ly 

871 - ttc gac teg tat teg aeg ttc- aeg ctg cag 
901 - cec gcg gcg atg ttc gcg ctg ccg gta tgc 
931 - cgt cgt cec ggt gea aag ace gtg acc gcg 
961 - etc ggg ggt ggc tat tta gee age ggg gte 
991 - ggg gcg aag gae cgc atg ccg act cec tac 

1021- cLg cug yLu ygy *-:Ly ctciy uLu ciciL yi^y uLy 

-L051- gag gga aeg ggc gaa gtt cag aca ccg eta 
1081- tee ggt gat gca gcc cga egg ctg aag ctt 
1111- ggc gae aag gte tac ttc cgc cac acc aag 
1141- gee ggt gag ctg tgt gag egg ttc gae cat 
1171- ctg cat ctg gte cgt ggc get gaa gta gte 

1201- yciu cLCu yLu cue dUC Ldc uyy yyL ydd yyy 

1231- cgc acc ttc etc 

taatgctgaaatggacgaggcceacecggctcaeeeggc 

agatgcggggcggcccggtggcccaattcaaggegcgcg 

aagaggagctgceatgacaecgatcaccgccctgccgac 
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Figure 12 (Cont'd) 

cgagttggcggccatgcgcgaggtagtcgagacgctcgc 
acccattgagcgtgccgcgggcgagccgggtgagcacaa 

ggcgg 

>Rvl769: 1242 bp - M. tuberculosis - SEQ. I.D. NO. 60 

gtgcatgaggtggctgctcgtgagcaacgttcggacgggccgatgaggctggatgcgcag 

ggccgactgcagcgttacgaggaggcgttcgctgactacgatgcaccgtttgcgttcgta 

gatctcgacgcgatgtggggcaatgccgatcaactgcttgcgcgcgccggcgacaagccg 

atccgggtggcgtcgaagtcgctgcgttgccgaccactgcaacgcgaaatccttgatgcc 

agtgagcgattcgacgggctattgacgttcacgcttaccgagacgctgtggcttgccggc 

caaggtttctcgaacctgttgttggcctacccgccgaccgaccgggcggcattgcgtgcg 

cttggcgagctgacggccaaggacccggacggggcgccgatcgtgatggtggacagcgtg 

gagcaccttgacctgatcgagcgcacgaccgacaagccggtacggctgtgtctggatttc 

gatgccggctattggcgcgccggcgggcggataaaaattggttccaagcgctcgccgctg 

cacaccccggagcaggctcgcgcactcgcggtggagatcgcgcggcggccggcgctaacg 

ttggcggcgttgatgtgctacgaggcccacattgcgggGctcggtgacaacgtcgccggc 

aagcgggtccacaacgcgatcatccgtcggatgcagcgcatgtcgttcgaagagctgcgc 

gagcgtcgtgcccgggccgtcgagctggtgcgcgaggtcgccgacatcaagatcgtcaac 

gccggtggcaccggcgacttgcagctggttgcgcaggagccgttgattaccgaagcgacc 

gccggctcgggtttttacgcgccgacactgttcgactcgtattcgacgttcacgctgcag 

cccgcggcgatgttcgcgctgccggtatgccgtcgtcccggtgcaaagaccgtgaccgcg 

ctcgggggtggctatttagccagcggggtcggggcgaaggaccgcatgccgactccctac 

ctgccggtcgggctgaagctcaatgcgctggagggaacgggcgaagttcagacaccgcta 

tccggtgatgcagcccgacggctgaagcttggcgacaaggtctacttccgccacaccaag 

■y C'Cyy" t: y^a y"C ty't'y t gay'Cg'yt: t CyaCCa t Cc'y'Ca t C t'y y t C* Cy u'y'y'Cy'Ct'g a a'y t: a'yt: C 

gacaccgtccccacctaccggggtgaagggcgcaccttcctc 
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Figure 13 



Complete Vaccine Sequence 



Highlighted in bold the position and sequence of Fusion Insert 

1 aatgacggta aatggcccgc ctggcattat gcccagtaca tgaccttatg 
q-r - mar lal cpv hdlm 
51 ggactttcct acttggcagt acatctacgt attagtcatc gctattacca 
gls yla vhlr ish ryy 
101 tggtgatgcg gttttggcag tacatcaatg ggcgtggata gcggtttgac 

hgda via vhq wawi av- 
151 tcacggggat ttccaagtct ccaccccatt gacgtcaatg ggagtttgtt 
Itg isks pph - rq wefv 
201 ttggcaccaa aatcaacggg actttccaaa atgtcgtaac aactccgccc 

lap kst glsk ms - qlr 
251 cattgacgca aatgggcggt aggcgtgtac ggtgggaggt ctatataagc 

pida ngr - ac tvgg lyk 
301 agagctctct ggctaactag agaacccact gcttactggc ttatcgaaat 
qss Ian - rth cll ayrn 
351 taatacgact cactataggg agacccaagc ttagacgcct ggagacgcca 

-yd si - gdps Ida wrr 
401 tccacgctgt tttgacctcc atagaagaca ccgggaccga tccagcctcc 

hprc fdl hrr hrdr ssl 
451 gcggccggga acggtgcatt ggaacgcgga ttccccgtgc caagagtgac 
rgr erci gtr ipr aksd 
501 gtaagtaccg cctatagagt ctataggccc acccccttgg cttcttatgc 
vst ay;i: vyrp tpl asy. 
551 atgctatact gtttttggct tggggtctat acacccccgc ttcctcatgt 

acyt vfg Igs ihpr flm 
601 tataggtgat ggtatagctt agcctatagg tgtgggttat tgaccattat 
1-v mv-1 si - vwv idhy 
651 tgaccactcc cctattggtg acgatacttt ccattactaa tccataacat 

-pi pyw - ryf pll ihn 
701 ggctctttgc cacaactctc tttattggct atatgccaat acactgtcct 

male hns lyw lyan tls 
751 tcagagactg acacggactc tgtattttta caggatgggg tctcatttat 
frd - hgl cif tgw gliy 
801 tatttacaaa ttcacatata caacaccacc gtccccagtg cccgcagttt 

ylq ihi yntt vps ars 
851 ttattaaaca taacgtggga tctccacgcg aatctcgggt acgtgttccg 

fy-t - rg ist risg tcs 
901 gacatgggct cttctccggt agcggcggag cttctacatc cgagccctgc 
ghg Ifsg sgg ast sepc 
951 tcccatgcct ccagcgactc atggtcgctc ggcagctcct tgctcctaac 
sha ssd swsl gss 111 
1001 agtggaggcc agacttaggc acagcacgat gcccaccacc accagtgtgc 

tvea rlr list mptt tsv 
1051 cgcacaaggc cgtggcggta gggtatgtgt ctgaaaatga gctcggggag 
phk avav gyv sen elge 
1101 cgggcttgca ccgctgacgc atttggaaga cttaaggcag cggcagaaga 
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Figure 13 (Cont'd) 

rac tad afgr Ika aae 
1151 agatgcaggc agctgagttg ttgtgttctg ataagagtca gaggtaactc 

edag s-v vvf - -es evt 
1201 ccgttgcggt gctgttaacg gtggagggca gtgtagtctg agcagtactc 
pva vllt veg svv - avl 
1251 gttgctgccg cgcgcgccac cagacataat agctgacaga ctaacagact 

vaa ara trhn s-q tnr 
1301 gttcctttcc atgggacttt tctgcagtca ccgtccaagc ttggtaccgc 

Ifls mgl fcs hrps Ivp 
1351 caccaatgtg ggccaacgga acgtcaacgg taggcaaggc attcgcgctc 
ppm wang tst vgk afal 
1401 tcggcattca ttgtcggcgg gctgtggatc atcaccaccc agcacgtgaa 

saf ivg glwi itt qhv 
1451 caactiacgca acggcgtggt cggcatacag cgcgaccccg tactcgttgc 

nnya taw say satp ysl 
1501 ttggcgtcgc cgaaattggc aagggcacgt tctcgagtat ggcgetggcc 
Igv aeig kgt fss mala 
1551 acgattgtcg cgttgtacgc attcaccggt ttcgaatcga tcgcgaacgc 

tiv aly aftg fes ian 
1601 cgccgaagaa atggacgcgc cggaccggaa cctgccgaga gctataccga 

aaee mda pdr nlpr aip 
1651 tcgcgatctt ctcggttggc gcgatctact tgctcaccct aacggtagcg 
iai fsvg aiy llt ltva 
1701 atgctgctcg gatcgaacaa gatcgccgcg tcggacgaca ccgtgaaact 

mil -gsn kiaa sdd tvk 
17 51 ggccgcggcc atcggaaacg ctaccttccg aacgatcatc gtcgtcggag 

laaa ign atf rtii vvg 
1801 ccctgatatc gatgttcggc atcaatgtcg cggcctcgtt cggtgcaccg 
ali smfg inv aas fgap 
1851 cggctttgga ccgcgttagc ggacggatcc gaattcatgc atgaggtggc 

rlw tal adgs efm hev 
1901 tgctcgtgag caacgttcgg acgggccgat gaggctggat gcgcagggcc 

aare qrs dgp mrld aqg 
1951 gactgcagcg ttacgaggag gcgttcgctg actacgatgc accgtttgcg 
rlq rye e a. fa dyd apfa 
2001 ttcgtagatc tcgacgcgat gtggggcaat gccgatcaac tgcttgcgcg 

fvd Ida mwgn adq 11a 
2051 cgccggcgac aagccgatcc gggtggcgtc gaagtcgctg cgttgccgac 

ragd kpi rv a sksl rcr 
2101 cactgcaacg cgaaatcctt gatgccagtg agcgattcga cgggctattg 
plq reil das erf dgll 
2151 acgttcacgc ttaccgagac gctgtggctt gccggccaag gtttctcgaa 

tft Ite t lwl agq gfs 
2201 cctgttgttg gcctacccgc cgaccgaccg ggcggcattg cgtgcgcttg 

nlll ayp ptd raal ral 
2251 gcgagctgac ggccaaggac ccggacgggg cgccgatcgt gatggtggac 
gel takd pdg api vmvd- 
2301 agcgtggagc accttgacct gatcgagcgc acgaccgaca agccggtacg 

sve hid lier ttd kpv 
2351 gctgtgtctg gatttcgatg ccggctattg gcgcgccggc gggcggataa 
rlcl dfd agy wrag gri 
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Figure 13 (Cont'd) 

-^401 aatgatctag agggccctat tctatagtgt cacctaaatg ctagagctcg 
k-s rgpy siv spk c-ss 
2451 ctgatcagcc tcgactgtgc cttctagttg ccagccatct gttgtttgcc 

lis Idc af-l pal ccl 
2501 cctcccccgt gccttccttg accctggaag gtgccactcc cactgtcctt 

■plpr afl dpg rchs hep 
2551 tcctaataaa atgaggaaat tgcatcgcat tgtctgagta ggtgtcattc 
fli k-gn cia Ise - vsf 
2601 tattctgggg ggtggggtgg ggcaggacag caagggggag gattgggaag 

ysg gwg gagq qgg gl^ 
2651 acaatagcag gcatgctggg gatgcggtgg gctctatggc ttctgaggcg 
rq-q acw gcg glyg f-g 
2701 gaaagaacca gctggggctc tagggggtat ccccacgcgc cctgtagcgg 
gkn qlgl - gv spr al- r 
2751 cgcattaagc gcggcgggtg tggtggttac gcgcagcgtg accgctacac 

rik rgg cggy aqr dry 
2801 ttgccagcgc cctagcgccc gctcctttcg ctttcttccc ttcctttctc 

tcqr psa rsf rflp fls 
2851 gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg gcatcccttt 
rhv rrls pss sks ghpf 
2901 agggttccga tttagtgctt tacggcacct cgaccccaaa aaacttgatt 

rvp i.-c ftap rpq kt- 
2951 agggtgatgg ttcacgtagt gggccatcgc cctgatagac ggtttttcgc 

Ig-w ft - w ai alid gfs 
3001 cctttgacgt tggagtccac gttctttaat agtggactct tgttccaaac 
pfd vgvh vl - - wt Ivpn 
3051 tggaacaaca ctcaacccta tctcggtcta ttcttttgat ttataaggga 

wnn tqp ylgl ff - f ir 
3101 ttttggggat ttcggcctat tggttaaaaa atgagctgat ttaacaaaaa 

dfgd fgl ivk k-ad Itk 
3151 tttaacgcga attaattctg tggaatgtgt gtcagttagg gtgtggaaag 
i-r elil wnv cql gcgk 
3201 tccccaggct ccccaggcag gcagaagtat gcaaagcatg catctcaatt 

spg spg rqky akh asq 
3251 agtcagcaac caggtgtgga aagtccccag gctccccagc aggcagaagt 

Ivsn qvw kvp rips rqk 
3301 atgcaaagca tgcatctcaa ttagtcagca accatagtcc cgcccctaac 
yak hasq Ivs nhs papn 
' 3351 tccgcccatc ccgcccctaa ctccgcccag ttccgcccat tctccgcccc 

sah pap nsaq frp fsa 
3401 atggctgact aatttttttt atttatgcag aggccgaggc cgcctctgcc 

pwlt nff ylc rgrg-rlc 
3451 tctgagctat tccagaagta gtgaggaggc ttttttggag gcctaggctt 
1-a ipev vrr Ifw rprl 
3501 ttgcaaaaag ctcccgggag cttgtatatc cattttcgga tctgatcaag 

Iqk apg slyi hfr i-s 
3551 agacaggatg aggatcgttt cgcatgattg aacaagatgg attgcacgca 

rdria riv shd - trw iar 
3601 ggttctccgg ccgcttgggt ggagaggcta ttcggctatg actgggcaca 
rfs grig gea irl - Igt 
3651 acagacaatc ggctgctctg atgccgccgt gttccggctg tcagcgcagg 
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Figure 13 (Cont'd) 

tdn rll - err vpa vsa 
3701 ggcgcccggt tctttttgtc aagaccgacc tgtccggtgc cctgaatgaa 

gapg sfc qdr pvrc pe- 
37 51 ctgcaggacg aggcagcgcg gctatcgtgg ctggccacga cgggcgttcc 
tag rgsa aiv agh dgrs 
3801 ttgcgcagct gtgctcgacg ttgtcactga agcgggaagg gactggctgc 

Irs car rch - sgk -gla 
3851 tattgggcga agtgccgggg caggatctcc tgtcatctca ccttgctcct 

aigr sag ags pvis pes 
3901 gccgagaaag tatccatcat ggctgatgca atgcggcggc tgcatacgct 
ere sihh g-c naa aaya 
3951 tgatccggct acctgcccat tcgaccacca agcgaaacat cgcatcgagc 

-sg ylp irpp set shr 
4001 gagcacgtac tcggatggaa gccggtcttg tcgatcagga tgatctggac 

asty sdg srs c rsg - sg 
4051 gaagagcatc aggggctcgc gccagccgaa ctgttcgcca ggctcaaggc 
rra sgar asr tvr qaqg 
4101 gcgcatgccc gacggcgagg atctcgtcgt gacccatggc gatgcctgct 
aha rrr gsrr dpw rcl 
• 4151 tgccgaatat catggtggaa aatggcegct tttctggatt catcgactgt 
lae y hgg kwp Ifwi hrl 
4201 ggccggctgg gtgtggcgga ccgctatcag gacatagcgt tggctacccg 
wpa gcgg pis ghs vgyp 
4251 tgatattgct gaagagcttg gcggcgaatg ggctgaccgc ttcctegtgc 

-yc - ra wrrm g-p Ipr 
4301 tttacggtat cgccgctccc gattcgcagc gcatcgcctt ctatcgcctt 

airy rrs rfa ahrl Isp 
4351 cttgacgagt tcttctgagc gggactetgg ggttcgaaat gaccgaccaa 
s-r vll's gtl gfe mtdq 
4401 gcgacgccca acetgccatc acgagatttc gattccaccg ccgccttcta 

atp nip srdf dst aaf 
4451 tgaaaggttg ggcttcggaa tcgttttccg ggacgccggc tggatgatcpc 

yerl gfg ivf rdag wmi 
4501 tccagcgcgg ggatcteatg ctggagttct tcgcccaccc caacttgttt 
Iqr gdlm lef fah pnlf 
4551 attgcagctt ataatggtta caaataaagc aatagcatca caaatttcac 

iaa yng yk-s nsi tnf 
4 601 aaataaagca tttttttcac tgcattctag ttgtggtttg tccaaactca 

tnka.ffs Ihs scgl ski 
4 651 tcaatgtatc ttatcatgte tgtataccgt cgacctctag ctagagcttg 
inv s'yhv cip sts s-sl 
4701 gcgtaatcat ggtcatagct gtttcctgtg tgaaattgtt atccgetcac 

a-s ws - Ifpv - no ypl 
4751 aattccacac aacatacgag ccggaagcat aaagtgtaaa gcctggggtg 

tiph nir ags ikck awg 
4801 cctaatgagt gagctaactc acattaattg cgttgcgctc actgcccgct 
a- - vs-1 tli air slpa 
4851 ttccagtcgg gaaacctgtc gtgccagctg cattaatgaa tcggccaacg 

fqs gnl scql h- - igq 
4901 cgcggggaga ggcggtttgc gtattgggcg ctcttccgct tcctcgctca 

ragr ggl rig rssa ssl 
4951 ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac 
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Figure 13 (Cont'd) 



5001 
5051 
5101 
5151 
5201 
5251 
5301 
5351 
5401 
5451 
5501 
5551 
5601 
5651 
5701 
5751 
5801 
5851 
5901 
5951 
6001 
6051 
6101 
6151 
6201 
6251 



tds irsv vrl rra vsah 
tcaaaggcgg taatacggtt atccacagaa tcaggggata acgcaggaaa 

skavir iste sgdnag 
gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg 
knm - akg qq^^ arnr 
cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa 
all affii rlr ppd ehhk 
aatcgacgct caagtcagag gtggcgaaac ccgacaggac tataaagata 

nrr ssq rwrn ptg 1-r 
ccaggcgttt ccccctggaa gctccctcgt gcgctctcct gttccgaccc 
yq af PP'g ssl vrsp vpt 
tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg 
Ipl tgyl saf Ips gsva 
ctttctcaat gctcacgctg taggtatctc agttcggtgt aggtcgttcg 

isq csr cryl ssv ' ^ J 
ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg 
rskl gcv hep pvqp 
ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta 
alsgnyr lesnpvrh d 1 
tcgccactgg cagcagccac tggtaacaga attagcagag cgaggtatgt 

spl aaa tgn.- isr ary 
aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacacta 
V g g a tef Ikw wpny g i't 
gaaggacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga 
r rt vfgi cal Ikp vt fg 
aaaagagttg gtagctcttg atccggcaaa caaaccaccg ctggtagcgg 

krv gss - sgk qtt ags 
tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc 
g g f f V c'k qqi trrk kgs 
aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa 
q e d plif stg sda qwne 
aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac 

nsr - gi Ivmr Isk rif 
ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat 
t-il In - k-s fksi - s^^ 
atgagtaaac ttggtctgac agttaccaat gcttaatcag tgaggcacct 
v e - twsd syq cli seap 
atctcagcga tctgtctatt tcgttcatcc atagttgcct gactccccgt 

isa icl frss iva - 1 9 
cgtgtagata actacgatac gggagggctt accatctggc cccagtgctg 
v v-i tti reg Ipsg psa 
caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata 
ami prdp rsp apd l ^ a x 
aaccagccag ccggaagggc cgagcgcaga agtggtcctg caactttauc 
nqp agr aerr sgp ^^"^^^ 
cqcctccatc cagtctatta attgttgccg ggaagctaga gtaagtagtu 
s asi qsi ncc rear vss 
cgccagttaa tagtttgcgc aacgttgttg ccattgctac aggcatcgtg 
sp. Vnslr nvv aia t gi v 
gtgtcacgct cgtcgtttgg tatggcttca ttcagctccg gttcccaacg 

vsr ssf gmas fss q 
atcaaggcga gttacatgat cccccatgtt gtgcaaaaaa gcggttagct 
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Figure 13 (Confd) 

rsrr vt - spm ickk avs 
6301 ' ccttcggtcc tccgatcgtt gtcagaagta agttggccgc agtgttatca 
sfg ppiv vrs kla avis 
6351 ctcatggtta tggcagcact gcataattct cttactgtrca tgccatccgt 

Imv maa Ihns Itv mps 
6401 aagatgcttt tctgtgactg gtgagtactc aaccaagtca ttctgagaat 

vrcf svt gey stks f-e 
6451 agtgtatgcg gcgaccgagt tgctcttgcc cggcgtcaat acgggataat 
-cm rrps esc pas irdn 
6501 accgcgccac atagcagaac tttaaaagtg ctcatcattg gaaaacgttc 

tap hsr tlkv lii gkr 
6551 ttcggggcga aaactctcaa ggatcttacc gctgttgaga tccagttcga 

ssgr kls ril pllr sss 
6601 tgtaacccac tcgtgcaccc aactgatctt cagcatcttt tactttcacc 
m-p trap n-s sas ftft 
6651 agcgtttctg ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg 

svs g-a ktgr'qna akk 
6701 aataagggcg acacggaaat gttgaatact catactcttc ctttttcaat 

gira trk c-i lilf Ifq 
6751 attattgaag catttatcag ggttattgtc tcatgagcgg atacatattt 
yy - siyq gyc Ims gyif 
6801 gaatgtattt agaaaaataa acaaataggg gttccgcgca catttccccg 

eci -.kn kqig vpr tfp 
6851 aaaagtgcca cctgacgtcg acggatcggg agatctcccg atcccctatg 

rkvp pdv dgs gdlp ipy 
6901 gtcgactctc agtacaatct gctctgatgc cgcatagtta agccagtatc 
grl svqs aim phs - asi 
6951 tgctccctgc ttgtgtgttg gaggtcgctg agtagtgcgc gagcaaaatt 

csl Ivc wrsl ss a rak 
7001 taagctacaa caaggcaagg cttgaccgac aattgcatga agaatctgct 

fklq qgk a-p tia - ri<c 
7051 tagggttagg cgttttgcgc tgcttcgcga tgtacgggcc agatatacgc 
Igl-gvlr cfa myg pdir 
7101 gttgacattg attattgact agttattaat agtaatcaat tacggggtca 

vdi d-y - Ivin snq Irg 
7151 ttagttcata gcccatatat ggagttccgc gttacataac ttacggtaaa 

h-fi ahi wss alhn Ir- 
7201 tggcccgcct ggctgaccgc ccaacgaccc ccgcccattg acgtcaataa 
mar ladr ptt pah - rq- 
7251 tgacgtatgt tcccatagta acgccaatag ggactttcca ttgacgtcaa 

-rm f'p - - rq - gls idv 
7301 tgggtggact atttacggta aactgcccac ttggcagtac atcaagtgta 

ngwt iyg kip twqy ike 
7351 tcatatgcca agtacgcccc ctattgacgt c SEQ ID NO 17 

iic qvrp lit SEQ ID NO 18 
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Figure 14 

First half affusion insert from Open Reading Frame Rv1979c 

21 Mar 2003 Feature Properties 

Molecule: DNA Fusion Vaccine New, 7 381 bps mK 

Circular 

9 

File Name: 197 9-17 69Fusion.cin5 dated 29 2002 

Description: Ligation of inverted Rvl769 3' New* xnto 
first step 

Details: 'Rvl979c' , Gene, 1362 to 1874 

hypothetical protein Rvl97 9c 
Translation product 171 aas 
Mol Wt 17539.0, Isoelectric Pt (pl^ 5.13/ 

Translation: 

WANGTSWGKAFALSAFIVGGLWIITTQHVNNYATAWSAYSATPYSLLGVA 
EIGKGTFSSMAIATIVALYAFTGFESIANAAEEMDAPDRNLPRAIPIAIF 

SVGAI YLLT LTVAMLLGSNKIAAS DDTVKLAAAI GNAT FRT 1 1 WGAL I S 
MFGINVAASFGAPRLWTALAD' SEQ ID NO 19 

Position within the ORF of the segment to be fused in the vaccine (BOLD) 

M tabe«:ulis bacteriaIRvl979c!Rvl979c: 481 aa - POSSIBLE CONSERVED PERMEASE 

L : =S5 =SL ?5?= i= =i 

121 - AFPDKAWADA EQLFSVKTLT FLGFIGVLLA INLFGNRAIK y^^^fA!^^ VlmJ^TMS 

III - SwxxT^^ SSSS^S SISS 

foi : ^X?^? SSS^iS ioSGVI^PTKL SBKNQYBVPM VSFAIT^^^ 

ll\ - SSSSJ ^TGLAVI AKFVQFIIVP lALIAl^SQ AVEHAAV^ 
^21 - SwSVGIAV SYDYRCIFLV RGGPNYFSIA LIVITFVWP AMAYLHYYRl IRRVGDRPST 
481 - R SEQ ID NO 20 
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Figure 15 



Second half of fusion insert from Open Reading Frame Rv1769 



27 Mar 2003 



Feature Properties 



Molecule; 
Circular 

File. Name : 
Description 
first step 

Details : 



DNA Fusion Vaccine New, 7381 bps DNA 



1979-1769Fusion.cm5, dated 29 Jul 2002 
Ligation of inverted Rvl7 69 3' New* into 



'Rvl769'/ Gene, 1890 to 2402 

hypothetical protein Rvl7 69 

Translation product 171 aas 

Mol Wt 19105.2, Isoelectric Pt (pi) 4.78 



Translation: 

VHEViWlEQRSDGPMRLDAQGRLQRYEEAFADYDAPFAFVDLDAMWGNADQ 

LLARAGDKPIRVASKSLRCRPLQREILDASERFDGLLTFTLTETLWLiAGQ 

GFSNLLLAYPPTDRAALRALGELTAKDPDGAPIVMVDSVEHLDLIERTTD 



KPVRLC LD FDAGYWRAGGR I K 



SEQ ID NO 21 



Position witliin tfte ORF of the segment to be fused in the vaccine (BOLD) 

>M. tuberculis bacterialRvl769|Rvl769: 414 aa - CONSERVED HYPOTHETICAL PROTEIN 

SEQ ID NO 22 

1 - VHEVAAREQR SDGPMRLDAQ GRiQRYEEAF ADYDAPFAFV DLDAMWGNAD QIiIARACSDKP 
61 - IRVASKSUIC RPLQREILDA SERFDGLLTF TLTETLWLAG QGFSNUJLAY PPTDRAALRA 
121 - LGELTAKDPD GAPIVMVDSV EHIiDLIERTT DKPVRXiCLDF DAGYWRAGGR IKIGSKRSPL 
181 - HTPEQARAIA VEIARRPALT LAAIJMCYEAH IAGLGDNVAG KRVHNAIIRR MQRMSFEELR 
241 - ERRARAVELV REVADIKIVN AGGTGDLQLV AQEPLITEAT AGSGFYAPTL FDSYSTFTLQ 
301 - PAAMFALPVC RRPGAKTVTA LGGGYLASGV GAKDRMPTPY LPVGLKLNAL EGTGEVQTPL 
361 - SGDAARRLKL GDKVYFRHTK AGELCERFDH LHLVRGAEW DTVPTYRGEG RTFL 
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Box I Observations where certain claims were found unsearchable (Continuation of Hem 1 of first sheet) 

This international Search Report has not been established in respect of certain claims under Article ^ 7(2)(a) for the following reasons: 



1. I X I Claims Nos.: 



because they relate to subject matter not required to be searched by this Authority, namely: 

see FURTHER INFORMATION sheet PCT/ISA/210 



^' ^ bSe ?hey relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 



^' O Kse Sfey are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity ot invention is lacki ng (Continuation of item 2 of first s heet) 

This International Searching Authority found multiple Inventions in this international application, as follows: 



see additional sheet 



1 . rn AS ail required additional search lee? were timely paid by the applicant, this International Search Report covers all 
' — 1 searchable claims- 



2. □ AS all searchable claims could be searched without effort justifying an additional lee. this Authority did not invite payment 
— of any additional fee. 



3 n AS only some of the required additional search fees were timely paid by the applicant, this international Search Report 
I I covers only those claims lor which fees were paid, specifically claims Nos.: 



4 [71 No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
^ restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest 



I j The additional search fees were accompanied by the applicant's protest. 
[ I No protest accompanied the payment of additional search fees. 
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FURTHER INFORMATION CONTINUED FROM PCTASAf 210 



Continuation of Box I.l 

Although clalm(s) 17, 18 are directed to a diagnostic method practised 
on the human/animal body, the search has been carried out and based on 
the alleged effects of the compound/composition. 



Although claims 32, 33 are directed to a method of treatment of the 
human/animal body, the search has been carried out and based on the 
alleged effects of the compound/composition. 

The applicant's attention is drawn to the fact that claims relating to 
inventions in respect of which no international search report has been 
established need not be the subject of an International preliminary 
examination (Rule 66.1(e) PCT). The applicant is advised that the EPO 
policy when acting as an International Preliminary Examining Authority is 
normally not to carry out a preliminary examination on matter which has 
not been searched. This is the case irrespective of whether or not the 
claims are amended following receipt of the search report or during any 
Chapter II procedure. If the application proceeds into the regional phase 
before the EPO, the applicant is reminded that a search may be carried 
out during examination before the EPO (see EPO Guideline C-VI, 8.5), 
should the problems which led to the Article 17(2) declaration be 
overcome. 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. claims: 1, 2. 13-21. 26-28. 31-33 (all part) 

Rvl986 protein and nucleic acids (Seq.ID Nos. 1, 8, 61). 
diagnostic reagents / kits comprising said Proteinls), 
nucleic acids encoding said diagniostic reagents, methods 
for determining M. tuberculosis infection. Vaccines 
comprising the polypeptides or therefore encoding nucleic 
acids and vaccination methods with either. 

2. claims: 7. 8 and 1-3. 13-19, 26-28. 31-33 (all part) 

RV3878 protein and nucleic acids (Seq.ID Nos. 3, IG. 63 (& 
?^agments 35-47)). diagnostic reagents / kits comprising 
said protein(s), nucleic acids encoding said diagniostic 
reagents, methods for determining M. tuberculosis infection. 
Vaccines comprising the polypeptides or therefore encoding 
nucleic acids and vaccination methods with either. 



3. claims: 1. 2, 13-21. 26-28. 31-33 (all part) 

Rvl983 protein and nucleic acids (Seq.ID Nos. 4, 11, 64). 
diagnostic reagents / kits comprising said protein(s). 
nucleic actds encoding said diagniostic reagents, methods 
for determining M. tuberculosis infection. Vaccines 
comprising the polypeptides or therefore encoding nucleic 
acids and vaccination methods with either. 

4. claims: 1-3. 13-19*, 26-28, 31-33 (all part) 

RV3873 protein and nucleic acids (Seq.ID Nos. 5, 12, 65 (& 
fragments 7. 23-34)). diagnostic reagents / kits comprising 
said protein(s). nucleic acids encoding said diagniostic 
reagents, methods for determining M. tuberculosis infection, 
vaccines comprising the polypeptides or therefore encoding 
nucleic acids and vaccination methods with either. 

5. claims: 4-6. 9-12 and 1-3. 13-19. 25-28. 31-33 (all part) 

Rv3879c protein and nucleic acids (Seq.ID Nos. 5, 13, 66 (& 
fragments 48-58)). diagnostic reagents / kits comprising 
said protein(s), nucleic acids encoding said diagniostic 
reagents, methods for determining M. tuberculosis infection. 
Vaccines comprising the polypeptides or therefore encoding 
nucleic acids and vaccination methods with either. 
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6. claims: 19. 26-28, 31-33 (all part) 

Rv3872 protein and nucleic acids (Seq.ID Nos. 2, 9, 62). 
diagnostic reagents comprising said protein(s), nucleic 
acids encoding said diagniostic reagents, methods for 
determining M. tuberculosis infection. Vaccines comprising 
the polypeptides or therefore encoding nucleic acids and 
vaccination methods with either. 



7. claims: 20-33 (all part) 



Rvl979c protein and nucleic acids (Seq.ID Nos. 14, 59, 7G) 
diagnostic reagents comprising said protein(s), nucleic 
acids encoding said diagniostic reagents, methods for 
determining M. tuberculosis infection. Vaccines comprising 
the polypeptides or therefore encoding nucleic acids and 
vaccination methods with either. Further, fusion proteins 
of Rvl979c and Rvl769. 



8.. claims: 20-23, 26-33 (all part) 



Rvl769 protein and nucleic acids (Seq.ID Nos. 15, 60, 71). 
diagnostic reagents comprising said protein(s), nucleic 
acids encoding said diagniostic reagents, methods for 
determining M. tuberculosis infection, Vaccines comprising 
the polypeptides or therefore encoding nucleic acids and 
vaccination methods with either. 
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